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Sir : 

As authorized by the inventor (s), transmitted herewith for 
filing is a patent application applied for on behalf of the 
inventor (s) according to the provisions of 37 C.F.R. § 1.41(c), 
which claims priority under 35 U.S.C. § 119(e) of Provisional 
Application No. 60/160,767 filed on October 21, 1999. 



Inventor (s): Nickolai ALEXANDROV, Maxim TROUKHAN 



For: 



SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING 
POLYPEPTIDES ENCODED THEREBY 



Enclosed are: 



□ 



A specification consisting of Description (94 pages), Table 1 
(4 pages). Claims (3 pages). Schematic (1 page), and an 
Abstract (1 page) totaling one-hundred and three (103) pages 



( 



) sheet (s) of formal drawings 



Certified copy of Priority Document (s) 

Executed Declaration in accordance with 37 C.F.R. § 1,64 will 
follow 



A statement to establish small entity status under 37 C.F.R. 
§ 1.9 and 37 C.F.R. § 1.27 



Mail Addrkss; P.O. Box 747, Falls Church, Viroinia* USA 22040-0747 



"1 
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□ Preliminary Amenciment 
3 Information Sheet 

□ Information Disclosure Statement, PTO-1449 and reference (s) 

□ Amend the specification by inserting before the first line 
the sentence: 

This application claims priority on provisional Application 
No. filed on , the entire contents of which are 

hereby incorporated by reference. — 

El Two identical CD-Rs created using IBM-PC, MS-Windows 
operating system. The files contained on the CD-R are as 
follows : 



File Name 


Date of Creation 


File Size 


001018 Protein Domain Table 


October 19, 2000 


2.05 MB 



□ Amend the specification by inserting before the first line 
the sentence: 

— This application contains a CDR, the entire contents of 
which are hereby incorporated by reference. The CDR 
contains the following files: 



File Name 


Date of Creation 


File Size 









^ other: Power of Attorney regarding Small Entity Statement, 
ATCC Deposit receipts PTA-595, PTA-1161, PTA-1411 

The filing fee has been calculated as shown below: 







LARGE ENTITY 


SMALL ENTITY 




BASIC 


FEE 


$690.00 


$345.00 




NUMBER 
FILED 


NUMBER 
EXTRA 


RATE FEE 


RATE FEE 


TOTAL 
CLAIMS 


20- 20 = 


0 


X 18 = $0.00 


X 9 = $0.00 


INDEPENDENT 
CLAIMS 


3- 3 = 


0 


X 78 = $0.00 


X 39 = $0.00 


MULTIPLE DEPENDENT 
1 □ CLAIMS PRESENTED 


+ $260.00 


+ $130.00 




TOTAL 


$0.00 


$345.00 
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^ The application transmitted herewith is filed in accordance 
with 37 C.F.R. § 1.41(c). The undersigned has been authorized 
by the inventor (s) to file the present application. The 
original duly executed declaration together with the 
surcharge will be forwarded in due course. 

3 A check in the amount of $345.00 to cover the filing fee is 
enclosed. 

□ Please charge Deposit Account No. 50-1055 in the amount of 
$0.00 to cover the filing fee. A triplicate copy of this 
transmittal form is enclosed. 



3 Please send correspondence to: 

BIRCH, STEWART, KOLASCH & BIRCH, LLP - Customer No . 2292 

P.O. Box 747 

Falls Church, VA 22040-0747 
Telephone: (703) 205-8000 

If the Primary Deposit Account No. 50-1055 is deficient and 
non-payment will result in a loss of rights, the Commissioner is 
hereby authorized in this, concurrent and future replies, to 
charge payment or credit any overpayment to Deposit Account No. 
02-2448 for any additional fees required under 37 C.F.R. § 1.16 or 
under 37 C.F.R. § 1.17; particularly, extension of time fees. 

Respectfully submitted, 

BIRCH, STEWART, KOLASCH & BIRCH, LLP 




R^mofcd C. Stewart, #21,066 



P.O. Box 747 

RCS/CAV Falls Church, VA 22040-0747 

2 750-1300P (703) 2 05-8000 

Attachments 

(Rev, 04/19/2000) 
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'statement claiming small entity status Docket Number: 2750-130QP 

(37 CFR 1.9(f) & 1.27(c)) - SMALL BUSINESS CONCERN 

Applicant, Patentee, or Identifier: N. ALEXANDRQV et al, . 



Application or Patent No.: NEW Patent Application 
Filed or Issued: October 20, 2000 



Title: SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDES 
ENCODED THEREBY — 

I hereby state that I am 

□ the owner of the small business concern identified below: 
3 an official of the small business concern empowered to act on behalf of 
the concern identified below: 

NAME OF SMALL BUSINESS CONCERN CERES, INC. 



ADDRESS OF SMALL BUSINESS CONCERN 3007 Malibu Canyon Road Malibu, CA 90265 — 

I hereby state that the above identified small business concern qualifies as a small 
business concern as defined in 37 CFR Part 121 for purposes of paying reduced fees to the United 
States Patent and Trademark Office, in that the number of employees of the concern, including 
those of its affiliates, does not exceed 500 persons. For purposes of this statement, (1) the 
number of employees of the business concern is the average over the previous fiscal year of the 
concern of the persons employed on a full-time, part-time, or temporary basis during each ot J:tie 
pay periods of the fiscal year, and (2) concerns are affiliates of each other when either, 
directly or indirectly, one concern controls or has the power to control the other, or a third 
party or parties controls or has the power to control both. 

I hereby state that rights under contract or law have been conveyed to and remain with 
the small business concern identified above with regard to the invention described m: 



the specification filed herewith with title as listed above. 
^ the application identified above. 
^ the patent identified above. 

If the rights held by the above identified small business concern are not 
exclusive, each individual, concern, or organization having rights in the invention 
must file separate statements as to their status as small entities, and no rights ^ to 
the invention are held by any person, other than the inventor, who would not qualify 
as an independent inventor under 37 CFR 1.9(c) if that person made the invention, or 
by any concern which would not qualify as a small business concern under 37 CFR 
1.9(d), or a nonprofit organization under 37 CFR 1.9(e). 

Each person, concern, or organization having any rights in the 
invention is listed below: 

3 no such person, concern, or organization exists. 

□ each such person, concern, or organization is listed below. 

Separate statements are required from each named person, concern, _ or 
organization having rights to the invention stating their status as small entities. 
(37 CFR 1.27) 

I acknowledge the duty to file, in this application or patent, notification of 
any change in status resulting in loss of entitlement to small entity status prior to 
paying, or at the time of paying, the earliest of the issue fee or any maintenance fee 
due after the date on which status as a small entity is not longer appropriate. (37 
CFR 1.28 (b) ) 

NAME OF PERSON SIGNING Raymond C. Stewart (Reg. No. 21, 066) . 



TITLE IN ORGANIZATION OF PERSON SIGNING Legal Representative of CERES, INC. 

ADDRESS OF PERSON SIGNING Birch, Stewart, Kolasch and Birch, LLP. 

P.O. Box 747 Falls Church, VA 22040-0747 




SIGNATURE ^l^^'^^^>^Mf>^..^y;^ — ^^^^^H^^^^X^^^^lT^ DATE October 20, 2000 

Rev. 10/12/1998 



PATENT 
2750-1300P 



IN THE U.S. PATENT AND TRADEMARK OFFICE 



Applicant : 



Appl. No. : 



NEW 



Nickolai ALEXANDROV and Maxim TROUKHAN 



INFORMATION 



SHEET 



O 




Filed: 



October 20, 2000 



For: 



SEQUENCE-DETERMINED DNA FRAGMENTS AND 
CORRESPONDING POLYPEPTIDES ENCODED 
THEREBY 



Priority Claimed: 2750-590P 60/160,767 October 21, 1999 



Send Correspondence to: 

BIRCH, STEWART, KOLASCH & BIRCH, LLP or CUSTOMER NO. 2292 
P.O. Box 747 

Falls Church, VA 22040-0747 
(703) 205-8000 



The above information is submitted to advise the Q.S.P.T.O. 
of all relevant facts in connection with the present application. 

A timely executed Declaration in accordance with 37 C.F.R, 
§ 1.64 will follow. 



Respectfully submitted. 



BIRCH, STEWART, KOLASCH & BIRCH, LLP 




RCS/CAV 
2750-1300P 



P.O. Box 747 

Falls Church, VA 22040-0747 
(703) 205-8000 



(Rev. 04/19/2000) 
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SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING 
POLYPEPTIDES ENCODED THEREBY 

This application claims priority under 35 use § 119(e), §119(a-d)and §120 of the 
following applications, the entire contents of which are hereby incorporated by reference: 



Country 


Filing Date 


Attorney No. 


Client No. 


Application No. 


United States 


10/21/1999 


2750-590P 


80150.002 


60/160,767 



This application contains a CDR, the entire contents of which are hereby incorporated 
by reference. The CDR contains the following files: 



File Name 


Date of Creation 


File Size 


001018 Protein Domain Table 


October 19, 2000 


2.05 MB 



FIELD OF THE INVENTION 

The present invention relates to isolated polynucleotides that represent a complete 
gene, or a fragment thereof, that is expressed. In addition, the present invention relates to the 
polypeptide or protein corresponding to the coding sequence of these polynucleotides. The 
present invention also relates to isolated polynucleotides that represent regulatory regions of 
genes. The present invention also relates to isolated polynucleotides that represent 
untranslated regions of genes. The present invention further relates to the use of these isolated 
polynucleotides and polypeptides and proteins. 

DESCRIPTION OF THE RELATED ART 

Efforts to map and sequence the genome of a number of organisms are in progress; a few 
complete genome sequences, for example those of £. coli and Saccharomyces cerevisiae are 
known (Blattner et aL, Science 277:1453 (1997); Goffeau et al.. Science 274:546 (1996)). The 
complete genome of a multicellular organism, C elegans, has also been sequenced (See, the C 
elegans Sequencing Consortium, Science 282:2012 (1998)). To date, no complete genome of a 
plant has been sequenced, nor has a complete cDNA complement of any plant been sequenced. 

SUMMARY OF THE INVENTION 

The present invention comprises polynucleotides, such as complete cDNA sequences 
and/or sequences of genomic DNA encompassing complete genes, fragments of genes, and/or 
regulatory elements of genes and/or regions with other functions and/or intergenic regions, 
hereinafter collectively referred to as Sequence-Determined DNA Fragments (SDFs), from 
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different plant species^ particularly com, wheat, soybean, rice mdArabidopsis thaliana^ and 
other plants and or mutants, variants, fragments or fusions of said SDFs and polypeptides or 
proteins derived therefrom. In some instances, the SDFs span the entirety of a protein-coding 
segment. In some instances, the entirety of an mRNA is represented. Other objects of the 
5 invention that are also represented by SDFs of the invention are control sequences, such as, but 
not limited to, promoters. Complements of any sequence of the invention are also considered 
part of the invention. 

Other objects of the invention are polynucleotides comprising exon sequences, 
polynucleotides comprising intron sequences, polynucleotides comprising introns together with 
1 0 exons, intron/exon junction sequences, 5' untranslated sequences, and 3' untranslated sequences 
of the SDFs of the present invention. Polynucleotides representing the joinder of any exons 
described herein, in any arrangement, for example, to produce a sequence encoding any 
f % desirable amino acid sequence are within the scope of the invention. 

The present invention also resides in probes useful for isolating and identifying nucleic 
>fl 1 5 acids that hybridize to an SDF of the iiwention. The probes can be of any length, but more 
1 1 typically are 12-2000 nucleotides in length; more typically, 15 to 200 nucleotides long; even 

^ more typically, 18 to 100 nucleotides long. 

Yet another object of the invention is a method of isolating and/or identifying nucleic 
I^^r acids using the following steps: 

n J 2 0 (a) contacting a probe of the instant invention with a polynucleotide sample under 

f% conditions that permit hybridization and formation of a polynucleotide duplex; and 

(b) detecting and/or isolating the duplex of step (a). 

The conditions for hybridization can be from low to moderate to high stringency 
conditions. The sample can include a polynucleotide having a sequence unique in a plant 

2 5 genome. Probes and methods of the invention are useful, for example, without limitation, for 
mapping of genetic traits and/or for positional cloning of a desired fragment of genomic DNA. 

Probes and methods of the invention can also be used for detecting alternatively spliced 
messages within a species. Probes and methods of the invention can further be used to detect or 
isolate related genes in other plant species using genomic DNA (gDNA) and/or cDNA libraries. 

30 In some instances, especially when longer probes and low to moderate stringency hybridization 
conditions are used; the probe will hybridize to a plurality of cDNA and/or gDNA sequences of 
a plant. This approach is useful for isolating representatives of gene families which are 
identifiable by possession of a common functional domain in the gene product or which have 
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common cis-acting regulatory sequences. This approach is also useful for identifying 
orthologous genes from other organisms. 

The present invention also resides in constructs for modulating the expression of the 
genes comprised of all or a fragment of an SDF. The constructs comprise all or a fragment of 
the expressed SDF, or of a complementary sequence. Examples of constructs include 
ribozymes comprising RNA encoded by an SDF or by a sequence complementary thereto, 
antisense constructs, constructs comprising coding regions or parts thereof, constructs 
comprising promoters, introns, untranslated regions, scaffold attachment regions, methylating 
regions, enhancing or reducing regions, DNA and chromatin conformation modifying 
sequences, etc. Such constructs can be constructed using viral, plasmid, bacterial artificial 
chromosomes (BACs), plasmid artificial chromosomes (PACs), autonomous plant plasmids, 
plant artificial chromosomes or other types of vectors and exist in the plant as autonomous 
replicating sequences or as DNA integrated into the genome. When inserted into a host cell 
the construct is, preferably, functionally integrated with, or operatively linked to, a 
heterologous polynucleotide. For instance, a coding region from an SDF might be operably 
linked to a promoter that is functional in a plant. 

The present invention also resides in host cells, including bacterial or yeast cells or plant 
cells, and plants that harbor constructs such as described above. Another aspect of the invention 
relates to methods for modulating expression of specific genes in plants by expression of the 
coding sequence of the constructs, by regulation of expression of one or more endogenous genes 
in a plant or by suppression of expression of the polynucleotides of the invention in a plant. 
Methods of modulation of gene expression include without limitation (1) inserting into a host 
cell additional copies of a polynucleotide comprising a coding sequence; (2) modulating an 
endogenous promoter in a host cell; (3) inserting antisense or ribozyme constructs into a host 
cell and (4) inserting into a host cell a polynucleotide comprising a sequence encoding a variant 
, fragment, or fusion of the native polypeptides of the instant invention. 

BRIEF DESCRIPTION OF THE TABLES 

In TABLE 1, the format of the data is as follows: 

In Table 1, sequence data are presented in the form of annotation of a reference 
sequence. The format is shown below. The reference sequence is shown at the top of the 
annotation file as a 7 digit sequence number preceded by >" (e.g. >5019261). The sequence 
identifier is a gi" number that identifies a specific DNA sequence in the publically 
accessible BLAST Databases on the NCBI FTP web site (accessible at ncbi.nlm.gov/blast). 



Attorney No. 2750-1300P 

4 

In particular, the nt.Z" nucleotide sequence data base at the NCBI FTP site utilizes the gi" 
identifiers to assign by NCBI a unique identifier for each sequence in the databases, thereby 
providing a non-redundant database for sequences from various data bases, including 
GenBank, EMBL, DDBJ (DNA Database of Japan) and PDB (Brookhaven Protein Data 
Bank). Thus, the line in TABLE 1 beginning with sequence number identifies the unique gi" 
identifier followed by the corresponding GenBank (gb) accession number and locus. The gi 
identifier number is followed by a /" and if present a number cataloging a corresponding 
clone identifier number, which denotes a cDNA sequence that aligns with the genomic DNA 
sequence. The reference sequence number is followed on the next line by data regarding the 
length of the sequence ( len") and the number of exons found in the sequence by the analysis 
program ( nex"). 

The annotation data are presented in columns; the leftmost column identifies the position of 
the putative exon in the gene as initial ( init"), internal ( intr") or terminal ( term"). Genes 
considered composed of a single exon are denoted sngl". The next column describes the 
position in the nucleotide sequence beginning the exon ( start") and the next column 
describes the position in the nucleotide sequence ending the exon ( stop"). The direction of 
the gene is indicated in the next column, +" indicating 5' - 3' in the direction presented in 
the database, - indicating the opposite orientation. The gene number" is given in the final 
column. Exons having the same gene number are grouped in the order shown to create the 
relevant coding sequence. 
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>5019261 <^ This is the gi number of the public sequence 

len = 97208 nex = 121 

ft fr 

Length Number exons 

of public sequence 



Exon 


Start 


Stop 


Type 








1} 
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Sngl 
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-? z. o y 
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i ^ w o 


1 3 R Q 
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In 1 1- 


2 1 4 7 


9 1 R Q 

Z ± O O y 


Tnt T 
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ii. X o z 


TezTTi 
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234 88 


9 3 S (^7 
o u u / 
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25035 


il- ^ X O 


Intr 


25466 


25589 
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25677 


25786 


Intr 


25899 


25962 


Intr 


26045 


26109 


Intr 


26188 


26253 


Term 


26350 


26448 


Sngl 


27671 


27793 


Sngl 


29126 


29299 


Sngl 


30266 


30364 


Sngl 


31717 


31929 


Sngl 


32102 


32209 


Sngl 


32450 


32548 



Direction Gene 

Number 
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X 


_L 

i 


z 


X 

r 
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15 



Sngl 


32634 


32726 


+ 


20 


Init 


35603 


35743 


+ 


21 


Term 


35829 


36185 


+ 


21 


Init 


36954 


37098 




22 


Term 


38100 


38158 




22 


Init 


39635 


39944 


+ 


23 
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40242 


40372 


+ 


23 


Intr 


40462 


40695 
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23 
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40815 


41070 
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23 


Intr 


41176 


41255 
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42212 


42419 




23 


Intr 


42940 
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23 



DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to (I) polynucleotides and methods of use thereof, such as 
lA. Probes, Primers and Substrates; 

2 0 IB. Methods of Detection and Isolation; 

B . 1 . Hybridization; 

B.2. Methods of Mapping; 

B.3. Southern Blotting; 

B. 4. Isolating cDNA from Related Organisms; 

25 B .5 . Isolating and/or Identifying Orthologous Genes 

IC. Methods of Inhibiting Gene Expression 

C. l. Antisense 

C.2. Ribozyme Constructs; 
C.3. Chimeraplasts; 

3 0 C.4 Co-Suppression; 

C.5, Transcriptional Silencing 

C.6. Other Methods to Inhibit Gene Expression 

ID. Methods of Functional Analysis; 

IE. Promoter Sequences and Their Use; 

3 5 IF, UTRs and/or Intron Sequences and Their Use; and 
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IG. Coding Sequences and Their Use. 

The invention also relates to (II) polypeptides and proteins and methods of use thereof, 
such as IIA. Native Polypeptides and Proteins 

A.l Antibodies 

A. 2 In Vitro Applications 

IIB. Polypeptide Variants, Fragments and Fusions 

B. l Variants 
B.2 Fragments 
B.3 Fusions 

The invention also includes (III) methods of modulating polypeptide production, such as 

IIIA. Suppression 

A.l Antisense 
A.2 Ribozymes 
A.3 Co-suppression 

A.4 Insertion of Sequences into the Gene to be Modulated 
A.5 Promoter Modulation 

A. 6 Expression of Genes containing Dominant-Negative Mutations 

IIIB. Enhanced Expression 

B. l Insertion of an Exogenous Gene 
B.2 Promoter Modulation 

The invention further concems (IV) gene constructs and vector construction, such as 
IVA. Coding Sequences 
IVB. Promoters 
IVC. Signal Peptides 

The invention still further relates to 
V Transformation Techniques 



Definitions 
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Allelic variant An allelic variant" is an alternative form of the same SDF, which 

resides at the same chromosomal locus in the organism. Allelic variations can occur in any 
portion of the gene sequence, including regulatory regions. Allelic variants can arise by 
normal genetic variation in a population. Allelic variants can also be produced by genetic 
engineering methods. An allelic variant can be one that is found in a naturally occurring 
plant, including a cultivar or ecotype. An allelic variant may or may not give rise to a 
phenotypic change, and may or may not be expressed. An allele can result in a detectable 
change in the phenotype of the trait represented by the locus. A phenotypically silent allele 
can give rise to a product. 

Alternatively spliced messages Within the context of the current invention, 
alternatively spliced messages" refers to mature mRNAs originating from a single gene with 
variations in the number and/or identity of exons, introns and/or intron-exon junctions. 

Chimeric The term chimeric" is used to describe genes, as defined supra, or contmcts 
wherein at least two of the elements of the gene or construct, such as the promoter and the 
coding sequence and/or other regulatory sequences and/or filler sequences and/or complements 
thereof, are heterologous to each other. 

Constitutive Promoter: Promoters referred to herein as "constitutive promoters" actively promote 
transcription under most, but not necessarily all, environmental conditions and states of 
development or cell differentiation. Examples of constitutive promoters include the cauliflower 
mosaic virus (CaMV) 35S transcript initiation region and the 1' or 2' promoter derived from 
T-DNA of Agrobacterium tumefaciens, and other transcription initiation regions from various 
plant genes, such as the maize ubiquitin-1 promoter, known to those of skill. 

Coordinately Expressed: The term coordinately expressed," as used in the current 
invention, refers to genes that are expressed at the same or a similar time and/or stage and/or 
under the same or similar environmental conditions. 
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Domain: Domains are fingerprints or signatures that can be used to characterize 

protein families and/or parts of proteins. Such fingerprints or signatures can comprise 
conserved (1) primary sequence, (2) secondary structure, and/or (3) three-dimensional 
conformation. Generally, each domain has been associated with either a family of proteins or 
motifs. Typically, these families and/or motifs have been correlated with specific in-vitro 
and/or in-vivo activities. A domain can be any length, including the entirety of the sequence 
of a protein. Detailed descriptions of the domains, associated families and motifs, and 
correlated activities of the polypeptides of the instant invention are described below. 
Usually, the polypeptides with designated domain(s) can exhibit at least one activity that is 
exhibited by any polypeptide that comprises the same domain(s). 

Endogenous The term endogenous," within the context of the current invention refers to 
any polynucleotide, polypeptide or protein sequence which is a natural part of a cell or 
organisms regenerated from said cell. 

Exogenous Exogenous," as referred to within, is any polynucleotide, polypeptide 

or protein sequence, whether chimeric or not, that is initially or subsequently introduced into 
the genome of an individual host cell or the organism regenerated from said host cell by any 
means other than by a sexual cross. Examples of means by which this can be accomplished 
are described below, and include Agrobacterium-mtdmttd transformation (of dicots - e.g. 
Salomon et al. EMBOJ, 3:141 (1984); Herrera-Estrella et al. EMBO J, 2:987 (1983); of 
monocots, representative papers are those by Escudero et al.. Plant J, 10:355 (1996), Ishida et 
al.. Nature Biotechnology 14:745 (1996), May et aL, Bio/Technology 13:486 (1995)), biolistic 
methods (Armaleo et aL, Current Genetics 17:97 1990)), electroporation, in planta 
techniques, and the like. Such a plant containing the exogenous nucleic acid is referred to 
here as a To for the primary transgenic plant and Ti for the first generation. The term 
exogenous" as used herein is also intended to encompass inserting a naturally found element 
into a non-naturally found location. 

Filler sequence: As used herein, filler sequence" refers to any nucleotide sequence that 
is inserted into DNA construct to evoke a particular spacing between particular components 
such as a promoter and a coding region and may provide an additional attribute such as a 
restriction enzyme site. 
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Gene: The term gene," as used in the context of the current invention, encompasses all 
regulatory and coding sequence contiguously associated with a single hereditary unit with a 
genetic function (see SCHEMATIC 1). Genes can include non-coding sequences that 
modulate the genetic function that include, but are not limited to, those that specify 
polyadenylation, transcriptional regulation, DNA conformation, chromatin conformation, 
extent and position of base methylation and binding sites of proteins that control all of these. 
Genes comprised of exons" (coding sequences), which may be interrupted by introns" 
(non-coding sequences), encode proteins. A gene's genetic function may require only RNA 
expression or protein production, or may only require binding of proteins and/or nucleic acids 
without associated expression. In certain cases, genes adjacent to one another may share 
sequence in such a way that one gene will overlap the other. A gene can be found within the 
genome of an organism, artificial chromosome, plasmid, vector, etc., or as a separate isolated 
entity. 

Gene Family: Gene family" is used in the current invention to describe a group of 
functionally related genes, each of which encodes a separate protein. 

Heterologous sequences: Heterologous sequences" are those that are not operatively 
linked or are not contiguous to each other in nature. For example, a promoter from com is 
considered heterologous to anArabidopsis coding region sequence. Also, a promoter from a 
gene encoding a growth factor from com is considered heterologous to a sequence encoding the 
com receptor for the growth factor. Regulatory element sequences, such as UTRs or 3' end 
termination sequences that do not originate in nature from the same gene as the coding sequence 
originates from, are considered heterologous to said coding sequence. Elements operatively 
linked in nature and contiguous to each other are not heterologous to each other. On the other 
hand, these same elements remain operatively linked but become heterologous if other filler 
sequence is placed between them. Thus, the promoter and coding sequences of a corn gene 
expressing an amino acid transporter are not heterologous to each other, but the promoter and 
coding sequence of a com gene operatively linked in a novel manner are heterologous. 

Homologous gene In the current invention, homologous gene" refers to a gene that shares 
sequence similarity with the gene of interest. This similarity may be in only a fragment of the 
sequence and often represents a functional domain such as, examples including without 
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limitation a DNA binding domain, a domain with tyrosine kinase activity, or the like. The 
functional activities of homologous genes are not necessarily the same. 

Inducible Promoter An inducible promoter" in the context of the current invention 

refers to a promoter which is regulated under certain conditions, such as light, chemical 
concentration, protein concentration, conditions in an organism, cell, or organelle, etc. A typical 
example of an inducible promoter, which can be utilized with the polynucleotides of the present 
invention, is PARSKl, the promoter from the Arabidopsis gene encoding a serine-threonine 
kmase enzyme, and which promoter is induced by dehydration, abscissic acid and sodium 
chloride (Wang and Goodman, Plant J. 8:37 (1995)) Examples of environmental conditions that 
may affect transcription by inducible promoters include anaerobic conditions, elevated 
temperature, or the presence of light. 

Intergenic region Intergenic region," as used in the current invention, refers to 
nucleotide sequence occurring in the genome that separates adjacent genes. 

Mutant gene In the current invention, mutant" refers to a heritable change in DNA 
sequence at a specific location. Mutants of the current invention may or may not have an 
associated identifiable function when the mutant gene is transcribed. 

Orthologous Gene In the current invention orthologous gene" refers to a second gene that 
encodes a gene product that performs a similar function as the product of a first gene. The 
orthologous gene may also have a degree of sequence similarity to the first gene. The 
orthologous gene may encode a polypeptide that exhibits a degree of sequence similarity to a 
polypeptide corresponding to a first gene. The sequence similarity can be found within a 
functional domain or along the entire length of the coding sequence of the genes and/or their 
corresponding polypeptides. 

Percentage of sequence identity "Percentage of sequence identity," as used herein, is 
determined by comparing two optimally aligned sequences over a comparison window, where 
the fragment of the polynucleotide or amino acid sequence in the comparison window may 
comprise additions or deletions (e.g., gaps or overhangs) as compared to the reference sequence 
(which does not comprise additions or deletions) for optimal alignment of the two sequences. 
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The percentage is calculated by determining the number of positions at which the identical 
nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of positions in the 
window of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. Optimal alignment of sequences for comparison may be conducted by the local 
homology algorithm of Smith and Waterman A Jrf^APL. MatK 2:482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch J. Mol Biol 48:443 (1970), by the search for 
similarity method of Pearson and Lipman Proc. Natl Acad, ScL (USA) 85: 2444 (1988), by 
computerized implementations of these algorithms (GAP, BESTFIT, BLAST, PASTA, and 
TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 
Science Dr., Madison, WI), or by inspection. Given that two sequences have been identified for 
comparison, GAP and BESTFIT are preferably employed to determine their optimal alignment. 
Typically, the default values of 5.00 for gap weight and 0.30 for gap weight length are used. 
The term "substantial sequence identity" between polynucleotide or polypeptide sequences 
refers to polynucleotide or polypeptide comprising a sequence that has at least 80% sequence 
identity, preferably at least 85%, more preferably at least 90% and most preferably at least 95%, 
even more preferably, at least 96%, 97%, 98% or 99% sequence identity compared to a 
reference sequence using the programs. 

Plant Promoter A plant promoter" is a promoter capable of initiating transcription in 

plant cells and can drive or facilitate transcription of a fragment of the SDF of the instant 
invention or a coding sequence of the SDF of the instant invention. Such promoters need not 
be of plant origin. For example, promoters derived from plant viruses, such as the CaMV35S 
promoter or from Agrobacterium tumefaciens such as the T-DNA promoters, can be plant 
promoters. A typical example of a plant promoter of plant origin is the maize ubiquitin-1 (ubi- 
l)promoter known to those of skill. 



Promoter: The term "promoter," as used herein, refers to a region of sequence 

determinants located upstream from the start of transcription of a gene and which are involved in 
recognition and binding of RNA polymerase and other proteins to initiate and modulate 
transcription. A basal promoter is the minimal sequence necessary for assembly of a 
transcription complex required for transcription initiation. Basal promoters frequently include a 
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TATA box" element usually located between 15 and 35 nucleotides upstream from the site of 
initiation of transcription. Basal promoters also sometimes include a CCAAT box" element 
(typically a sequence CCAAT) and/or a GGGCG sequence, usually located between 40 and 200 
nucleotides, preferably 60 to 120 nucleotides, upstream from the start site of transcription. 

Public sequence: The term public sequence as used in the context of the instant 
application, refers to any sequence that has been deposited in a publicly accessible database. 
This term encompasses both amino acid and nucleotide sequences. Such sequences are 
publicly accessible, for example, on the BLAST databases on the NCBI FTP web site 
(accessible at ncbi.nlm.gov/blast). The database at the NCBI GTP site utilizes gi" numbers 
assigned by NCBI as a unique identifier for each sequence in the databases, thereby 
providing a non-redundant database for sequence from various databases, including 
GenBank, EMBL, DBBJ, (DNA Database of Japan) and PDB (Brookhaven Protein Data 
Bank). 

Regulatory Sequence The term regulatory sequence," as used in the current 

invention, refers to any nucleotide sequence that influences transcription or translation 
initiation and rate, and stability and/or mobility of the transcript or polypeptide product. 
Regulatory sequences include, but are not limited to, promoters, promoter control elements, 
protein binding sequences, 5' and 3' UTRs, transcriptional start site, termination sequence, 
polyadenylation sequence, introns, certain sequences within a coding sequence, etc. 

Related Sequences: Related sequences" refer to either a polypeptide or a nucleotide 
sequence that exhibits some degree of sequence similarity with a sequence described in Table 
1. 

Scaffold Attachment Region (SAR) As used herein, scaffold attachment region" is a DNA 
sequence that anchors chromatin to the nuclear matrix or scaffold to generate loop domains 
that can have either a transcriptionally active or inactive structure (Spiker and Thompson 
(1996) Plant Physiol. 110: 15-21). 

Sequence-determined DNA fragments (SDFs) Sequence-determined DNA fragments" 

as used in the current invention are isolated sequences of genes, fragments of genes. 
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intergenic regions or contiguous DNA from plant genomic DNA or cDNA or RNA the 
sequence of which has been determined. 

Signal Peptide A signal peptide" as used in the current invention is an amino acid 

sequence that targets the protein for secretion, for transport to an intracellular compartment or 
organelle or for incorporation into a membrane. Signal peptides are indicated in the tables 
and a more detailed description located below. 

Specific Promoter In the context of the current invention, specific promoters" refers to a 
subset of inducible promoters that have a high preference for being induced in a specific 
tissue or cell and/or at a specific time during development of an organism. By high 
preference" is meant at least 3-fold, preferably 5-fold, more preferably at least 10-fold still 
more preferably at least 20-fold, 50-fold or 100-fold increase in transcription in the desired 
tissue over the transcription in any other tissue. Typical examples of temporal and/or tissue 
specific promoters of plant origin that can be used with the polynucleotides of the present 
invention, are: PTA29, a promoter which is capable of driving gene transcription specifically in 
tapetum and only during anther development (Koltonow et aL, Plant Cell 2:1201 (1990); RCc2 
and RCc3, promoters that direct root-specific gene transcription in rice (Xu et al., Plant MoL 
Biol 27:237 (1995); TobRB27, a root-specific promoter from tobacco (Yamamoto et al., Plant 
Cell 3:371 (1991)). Examples of tissue-specific promoters under developmental control include 
promoters that initiate transcription only in certain tissues or organs, such as root, ovule, fruit, 
seeds, or flowers. Other suitable promoters include those from genes encoding storage proteins 
or the lipid body membrane protein, oleosin. A few root-specific promoters are noted above. 

Stringency "Stringency" as used herein is a function of probe length, probe composition (G 
+ C content), and salt concentration, organic solvent concentration, and temperature of 
hybridization or wash conditions. Stringency is typically compared by the parameter T^, which 
is the temperature at which 50% of the complementary molecules in the hybridization are 
hybridized, in terms of a temperature differential from T^. High stringency conditions are those 
providing a condition of - 5^C to T^n - lO^'C. Medium or moderate stringency conditions are 
those providing Tm - 20''C to - 29''C. Low stringency conditions are those providing a 
condition of T^, - 40''C to - 48^C. The relationship of hybridization conditions to (in ^C) is 
expressed in the mathematical equation 
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= 81.5 -16.6(logio[Na^]) + 0.41(%G+C) - (600/N) (1) 

where N is the length of the probe. This equation works well for probes 14 to 70 nucleotides in 
length that are identical to the target sequence. The equation below for T^ of DNA-DNA 
hybrids is useful for probes in the range of 50 to greater than 500 nucleotides, and for conditions 
5 that include an organic solvent (formamide). 

Tn, = 81.5+16.6 log {[Na^]/(l+0.7[Na^])}+ 0.41(%G+C)-500/L 0.63(%formamide) (2) 

where L is the length of the probe in the hybrid. (P. Tijessen, Hybridization with Nucleic 
Acid Probes" in Laboratory Techniques in Biochemistry and Molecular Biolog y, P.C. vand 
der Vliet, ed., c. 1993 by Elsevier, Amsterdam.) The T^ of equation (2) is affected by the 
1 0 nature of the hybrid; for DNA-RNA hybrids T^, is 10-15^C higher than calculated, for RNA- 
RNA hybrids T^n is 20-25T higher. Because the T^ decreases about 1 for each 1% 
decrease in homology when a long probe is used (Bonner et al., J. MoL Biol. 81:123 (1973)), 
stringency conditions can be adjusted to favor detection of identical genes or related family 
members. 

1 5 Equation (2) is derived assuming equilibrium and therefore, hybridizations according 

to the present invention are most preferably performed under conditions of probe excess and 
for sufficient time to achieve equilibrium. The time required to reach equilibrium can be 
shortened by inclusion of a hybridization accelerator such as dextran sulfate or another high 
volume polymer in the hybridization buffer. 

2 0 Stringency can be controlled during the hybridization reaction or after hybridization 

has occurred by altering the salt and temperature conditions of the wash solutions used. The 
formulas shown above are equally valid when used to compute the stringency of a wash 
solution. Preferred wash solution stringencies lie within the ranges stated above; high 
stringency is 5-8''C below T^, medium or moderate stringency is 26-29''C below T^ and low 

2 5 stringency is 45-48°C below Tn,. 

Substantially free of A composition containing A is substantially free of B when 

at least 85% by weight of the total A+B in the composition is A. Preferably, A comprises at 
least about 90% by weight of the total of A+B in the composition, more preferably at least 

3 0 about 95% or even 99% by weight. For example, a plant gene or DNA sequence can be 

considered substantially free of other plant genes or DNA sequences. 
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Translational start site In the context of the current invention, a translational start 

site" is usually an ATG in the cDNA transcript, more usually the first ATG, A single cDNA, 
however, may have multiple translational start sites. 

Transcription start site Transcription start site" is used in the current invention to 

describe the point at vi^hich transcription is initiated. This point is typically located about 25 
nucleotides downstream from a TFIID binding site, such as a TATA box. Transcription can 
initiate at one or more sites within the gene, and a single gene may have multiple transcriptional 
start sites, some of which may be specific for transcription in a particular cell-type or tissue. 

Untranslated region (UTR) A UTR" is any contiguous series of nucleotide bases that is 
transcribed, but is not translated. These untranslated regions may be associated with 
particular functions such as increasing mRNA message stability. Examples of UTRs include, 
but are not limited to polyadenylation signals, terminations sequences, sequences located 
between the transcriptional start site and the first exon (5' UTR) and sequences located 
between the last exon and the end of the mRNA (3' UTR). 

Variant: The term variant" is used herein to denote a polypeptide or protein or 
polynucleotide molecule that differs from others of its kind in some way. For example, 
polypeptide and protein variants can consist of changes in amino acid sequence and/or charge 
and/or post-translational modifications (such as glycosylation, etc). 



DETAILED DESCRIPTION OF THE INVENTION 

I. Polynucleotides 

Exemplified SDFs of the invention represent fragments of the genome of com, wheat, 
rice, soybean ox Arabidopsis and/or represent mRNA expressed from that genome. The isolated 
nucleic acid of the invention also encompasses corresponding fragments of the genome and/or 
cDNA complement of other organisms as described in detail below. 
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Polynucleotides of the invention can be isolated from polynucleotide libraries using 
primers comprising sequence similar to those described by Table 1, See, for example, the 
methods described in Sambrook et al., supra. 

Alternatively, the polynucleotides of the invention can be produced by chemical 
synthesis. Such synthesis methods are described below. 

It is contemplated that the nucleotide sequences presented herein may contain some 
small percentage of errors. These errors may arise in the normal course of determination of 
nucleotide sequences. Sequence errors can be corrected by obtaining seeds deposited under the 
accession numbers cited herein, propagating them, isolating genomic DNA or appropriate 
mRNA from the resulting plants or seeds thereof, amplifying the relevant fragment of the 
genomic DNA or mRNA using primers having a sequence that flanks the erroneous sequence, 
and sequencing the amplification product. 

LA. Probes. Primers and Substrates 

SDFs of the invention can be applied to substrates for use in array applications such 
as, but not limited to, assays of global gene expression, for example under varying conditions 
of development, growth conditions. The arrays can also be used in diagnostic or forensic 
methods (WO95/35505, US 5,445,943 and US 5,410,270). 

Probes and primers of the instant invention will hybridize to a polynucleotide 
comprising a sequence in Table 1. Though many different nucleotide sequences can encode 
an amino acid sequence, the sequences of Table 1 are generally preferred for encoding 
polypeptides of the invention. However, the sequence of the probes and/or primers of the 
instant invention need not be identical to those in Table 1 or the complements thereof. For 
example, some variation in probe or primer sequence and/or length can allow additional 
family members to be detected, as well as orthologous genes and more taxonomically distant 
related sequences. Similariy, probes and/or primers of the invention can include additional 
nucleotides that serve as a label for detecting the formed duplex or for subsequent cloning 
purposes. 

Probe length will vary depending on the application. For use as primers, probes are 
12-40 nucleotides, preferably 18-30 nucleotides long. For use in mapping, probes are 
preferably 50 to 500 nucleotides, preferably 100-250 nucleotides long. For Southern 
hybridizations, probes as long as several kilobases can be used as explained below. 

The probes and/or primers can be produced by synthetic procedures such as the 
triester method of Matteucci et al. /, Am. Chem. Soc, 103:3185( 1981); or according to Urdea 
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et aL Proc. Natl Acad, 80:7461 (1981) or using commercially available automated 
oligonucleotide synthesizers. 



I.B. Methods of Detection and Isolation 

The polynucleotides of the invention can be utilized in a number of methods known to 
those skilled in the art as probes and/or primers to isolate and detect polynucleotides, 
including, without limitation: Southerns, Northerns, Branched DNA hybridization assays, 
polymerase chain reaction, and microarray assays, and variations thereof. Specific methods 
given by way of examples, and discussed below include: 

Hybridization 

Methods of Mapping 

Southern Blotting 

Isolating cDNA from Related Organisms 

Isolating and/or Identifying Orthologous Genes. 
Also, the nucleic acid molecules of the invention can used in other methods, such as high 
density oligonucleotide hybridizing assays, described, for example, in U.S. Pat. Nos. 
6,004,753; 5,945,306; 5,945,287; 5,945,308; 5,919,686; 5,919,661; 5,919,627; 5,874,248; 
5,871,973; 5,871,971; and 5,871,930; and PCT Pub. Nos. WO 9946380; WO 9933981; WO 
9933870; WO 9931252; WO 9915658; WO 9906572; WO 9858052; WO 9958672; and WO 
9810858. 

B.l. Hybridization 
The isolated SDFs of Table 1 of the present invention can be used as probes and/or 
primers for detection and/or isolation of related polynucleotide sequences through 
hybridization. Hybridization of one nucleic acid to another constitutes a physical property 
that defines the subject SDF of the invention and the identified related sequences. Also, such 
hybridization imposes structural limitations on the pair. A good general discussion of the 
factors for determining hybridization conditions is provided by Sambrook et al. ("Molecular 
Cloning, a Laboratory Manual, 2nd ed,, c. 1989 by Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY; see esp., chapters 11 and 12). Additional considerations and details of the 
physical chemistry of hybridization are provided by G.H, Keller and M.M. Manak DNA 
Probes", 2^^^ Ed. pp. 1-25, c. 1993 by Stockton Press, New York, NY. 



^ Attorney No. 2750-1300P 

19 

Depending on the stringency of the conditions under which these probes and/or primers 
are used, polynucleotides exhibiting a wide range of similarity to those in Table 1 can be 
detected or isolated. When the practitioner wishes to examine the result of membrane 
hybridizations under a variety of stringencies, an efficient way to do so is to perform the 
hybridization under a low stringency condition, then to wash the hybridization membrane 
under increasingly stringent conditions. 

When using SDFs to identify orthologous genes in other species, the practitioner will 
preferably adjust the amount of target DNA of each species so that, as nearly as is practical, 
the same number of genome equivalents are present for each species examined. This 
prevents faint signals from species having large genomes, and thus small numbers of genome 
equivalents per mass of DNA, from erroneously being interpreted as absence of the 
corresponding gene in the genome. 

The probes and/or primers of the instant invention can also be used to detect or isolate 
nucleotides that are identical" to the probes or primers. Two nucleic acid sequences or 
polypeptides are said to be "identical" if the sequence of nucleotides or amino acid residues, 
respectively, in the two sequences is the same when aligned for maximum correspondence as 
described below. 

Isolated polynucleotides within the scope of the invention also include allelic variants of 
the specific sequences presented in Table 1, The probes and/or primers of the invention can also 
be used to detect and/or isolate polynucleotides exhibiting at least 80% sequence identity with 
the sequences of Table 1 or fragments thereof. 

With respect to nucleotide sequences, degeneracy of the genetic code provides the 
possibility to substitute at least one base of the base sequence of a gene with a different base 
without causing the amino acid sequence of the polypeptide produced from the gene to be 
changed. Hence, the DNA of the present invention may also have any base sequence that has 
been changed from a sequence in Table 1 by substitution in accordance with degeneracy of 
genetic code. References describing codon usage include: Carels et al, J. Mol Evol 46: 45 
(1998) and Fennoy et al, Nucl Acids Res. 21(23) : 5294 (1993). 

B.2. Mapping 

The isolated SDF DNA of the invention can be used to create various types of genetic 
and physical maps of the genome of corn, Arabidopsis, soybean, rice, wheat, or other plants. 
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Some SDFs may be absolutely associated with particular phenotypic traits, allowing 
construction of gross genetic maps. While not all SDFs will immediately be associated with 
a phenotype, all SDFs can be used as probes for identifying polymorphisms associated with 
phenotypes of interest. Briefly, one method of mapping involves total DNA isolation from 
individuals. It is subsequently cleaved with one or more restriction enzymes, separated 
according to mass, transferred to a solid support, hybridized with SDF DNA and the pattern 
of fragments compared. Polymorphisms associated with a particular SDF are visualized as 
differences in the size of fragments produced between individual DNA samples after 
digestion with a particular restriction enzyme and hybridization with the SDF. After 
identification of polymorphic SDF sequences, linkage studies can be conducted. By using the 
individuals showing polymorphisms as parents in crossing programs, F2 progeny 
recombinants or recombinant inbreds, for example, are then analyzed. The order of DNA 
polymorphisms along the chromosomes can be determined based on the frequency with 
which they are inherited together versus independently. The closer two polymorphisms are 
together in a chromosome the higher the probability that they are inherited together. 
Integration of the relative positions of all the polymorphisms and associated marker SDFs can 
produce a genetic map of the species, where the distances between markers reflect the 
recombination frequencies in that chromosome segment. 

The use of recombinant inbred lines for such genetic mapping is described for 
Arabidopsis by Alonso-Blanco et al. {Methods in Molecular Biology, vol.82, Arabidopsis 
Protocols", pp. 137-146, J.M. Martinez-Zapater and J. Salinas, eds., c. 1998 by Humana 
Press, Totowa, NJ) and for com by Burr ( Mapping Genes with Recombinant Inbreds", pp. 
249-254. In Freeling, M. and V. Walbot (Ed.), The Maize Handbook, c. 1994 by Springer- 
Verlag New York, Inc.: New York, NY, USA; Berlin Germany; Burr et al. Genetics (1998) 
118: 519; Gardiner, J. et al., (1993) Genetics 134: 917). This procedure, however, is not 
limited to plants and can be used for other organisms (such as yeast) or for individual cells. 

The SDFs of the present invention can also be used for simple sequence repeat (SSR) 
mapping. Rice SSR mapping is described by Morgante et al. {The Plant Journal (1993) 3: 
165), Panaud et al. {Genome (1995) 38: 1170); Senior et al. {Crop Science (1996) 36: 1676), 
Taramino et al. {Genome (1996) 39: 277) and Ahn et al. {Molecular and General Genetics 
(1993) 241: 483-90). SSR mapping can be achieved using various methods. In one instance, 
polymorphisms are identified when sequence specific probes contained within an SDF 
flanking an SSR are made and used in polymerase chain reaction (PGR) assays with template 
DNA from two or more individuals of interest. Here, a change in the number of tandem 



.Attorney No. 2750-1300P 

21 

repeats between the SSR-flanking sequences produces differently sized fragments (U.S. 
Patent 5,766,847). Alternatively, polymorphisms can be identified by using the PGR 
fragment produced from the SSR-flanking sequence specific primer reaction as a probe 
against Southern blots representing different individuals (U.H. Refseth et al., (1997) 
Electrophoresis 18: 1519). 

Genetic and physical maps of crop species have many uses. For example, these maps 
can be used to devise positional cloning strategies for isolating novel genes from the mapped 
crop species. In addition, because the genomes of closely related species are largely syntenic 
(that is, they display the same ordering of genes within the genome), these maps can be used 
to isolate novel alleles from relatives of crop species by positional cloning strategies. 

The various types of maps discussed above can be used with the SDFs of the 
invention to identify Quantitative Trait Loci (QTLs). Many important crop traits, such as the 
solids content of tomatoes, are quantitative traits and result from the combined interactions of 
several genes. These genes reside at different loci in the genome, oftentimes on different 
chromosomes, and generally exhibit multiple alleles at each locus. The SDFs of the 
invention can be used to identify QTLs and isolate specific alleles as described by de Vicente 
and Tanksley {Genetics 134:585 (1993)). In addition to isolating QTL alleles in present crop 
species, the SDFs of the invention can also be used to isolate alleles from the corresponding 
QTL of wild relatives. Transgenic plants having various combinations of QTL alleles can 
then be created and the effects of the combinations measured. Once a desired allele 
combination has been identified, crop improvement can be accomplished either through 
biotechnological means or by directed conventional breeding programs (for review see 
Tanksley and McGouch, Science 277:1063 (1997)). 

In another embodiment, the SDFs can be used to help create physical maps of the 
genome of com, Arabidopsis and related species. Where SDFs have been ordered on a 
genetic map, as described above, they can be used as probes to discover which clones in 
large libraries of plant DNA fragments in YAGs, BAGs, etc. contain the same SDF or similar 
sequences, thereby facilitating the assignment of the large DNA fragments to chromosomal 
positions. Subsequently, the large BAGs, YAGs, etc. can be ordered unambiguously by more 
detailed studies of their sequence composition (e.g. Marra et al. (1997) Genomic Research 
7:1072-1084) and by using their end or other sequences to find the identical sequences in 
other cloned DNA fragments. The overlapping of DNA sequences in this way allows large 
contigs of plant sequences to be built that, when sufficiently extended, provide a complete 
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physical map of a chromosome. Sometimes the SDFs themselves will provide the means of 
joining cloned sequences into a contig. 

The patent publication WO95/35505 and U.S. Patents 5,445,943 and 5,410,270 
describe scanning multiple alleles of a plurality of loci using hybridization to arrays of 
oligonucleotides. These techniques are useful for each of the types of mapping discussed 
above. 

Following the procedures described above and using a plurality of the SDFs of 
the present invention, any individual can be genotyped. These individual genotypes can be 
used for the identification of particular cultivars, varieties, lines, ecotypes and genetically 
modified plants or can serve as tools for subsequent genetic studies involving multiple 
phenotypic traits. 

B.3 Southern Blot Hybridization 

The sequences from Table 1 can be used as probes for various hybridization 
techniques. These techniques are useful for detecting target polynucleotides in a sample or 
for determining whether transgenic plants, seeds or host cells harbor a gene or sequence of 
interest and thus might be expected to exhibit a particular trait or phenotype. 

In addition, the SDFs from the invention can be used to isolate additional members of 
gene families from the same or different species and/or orthologous genes from the same or 
different species. This is accomplished by hybridizing an SDF to, for example, a Southern 
blot containing the appropriate genomic DNA or cDNA. Given the resulting hybridization 
data, one of ordinary skill in the art could distinguish and isolate the correct DNA fragments 
by size, restriction sites, sequence and stated hybridization conditions from a gel or from a 
library. 

Identification and isolation of orthologous genes from closely related species and 
alleles within a species is particularly desirable because of their potential for crop 
improvement. Many important crop traits, such as the solid content of tomatoes, result from 
the combined interactions of the products of several genes residing at different loci in the 
genome. Generally, alleles at each of these loci can make quantitative differences to the trait. 
By identifying and isolating numerous alleles for each locus from within or different species, 
transgenic plants with various combinations of alleles can be created and the effects of the 
combinations measured. Once a more favorable allele combination has been identified, crop 
improvement can be accomplished either through biotechnological means or by directed 
conventional breeding programs (Tanksley et al. Science 277:1063(1997)). 
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The results from hybridizations of the SDFs of the invention to, for example. 
Southern blots containing DNA from another species can also be used to generate restriction 
fragment maps for the corresponding genomic regions. These maps provide additional 
information about the relative positions of restriction sites within fragments, further 
distinguishing mapped DNA from the remainder of the genome. 

Physical maps can be made by digesting genomic DNA with different combinations 
of restriction enzymes. 

Probes for Southern blotting to distinguish individual restriction fragments can range 
in size from 15 to 20 nucleotides to several thousand nucleotides. More preferably, the probe 
is 100 to 1,000 nucleotides long for identifying members of a gene family when it is found 
that repetitive sequences would complicate the hybridization. For identifying an entire 
corresponding gene in another species, the probe is more preferably the length of the gene, 
typically 2,000 to 10,000 nucleotides, but probes 50-1,000 nucleotides long might be used. 
Some genes, however, might require probes up to 1,500 nucleotides long or overlapping 
probes constituting the full-length sequence to span their lengths. 

Also, while it is preferred that the probe be homogeneous with respect to its sequence, 
it is not necessary. For example, as described below, a probe representing members of a gene 
family having diverse sequences can be generated using PGR to amplify genomic DNA or 
RNA templates using primers derived from SDFs that include sequences that define the gene 
family. 

For identifying corresponding genes in another species, the next most preferable 
probe is a cDNA spanning the entire coding sequence, which allows all of the mRNA-coding 
fragment of the gene to be identified. Probes for Southem blotting can easily be generated 
from SDFs by making primers having the sequence at the ends of the SDF and using corn or 
Arabidopsis genomic DNA as a template. In instances where the SDF includes sequence 
conserved among species, primers including the conserved sequence can be used for PGR 
with genomic DNA from a species of interest to obtain a probe. 

Similarly, if the SDF includes a domain of interest, that fragment of the SDF can be used to 
make primers and, with appropriate template DNA, used to make a probe to identify genes 
containing the domain. Alternatively, the PGR products can be resolved, for example by gel 
electrophoresis, and cloned and/or sequenced. Using Southern hybridization, the variants of 
the domain among members of a gene family, both within and across species, can be 
examined. 
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B.4.1 Isolating DNA from Related Org an isms 

The SDFs of the invention can be used to isolate the corresponding DNA from other 
organisms. Either cDNA or genomic DNA can be isolated. For isolating genomic DNA, a 
lambda, cosmid, BAG or YAC, or other large insert genomic library from the plant of interest 
can be constructed using standard molecular biology techniques as described in detail by 
Sambrook et aL 1989 (Molecular Cloning: A Laboratory Manual, 2""* ed. Cold Spring Harbor 
Laboratory Press, New York) and by Ausubel et al. 1992 (Current Protocols in Molecular 
Biology, Greene Publishing, New York). 

To screen a phage library, for example, recombinant lambda clones are plated out on 
appropriate bacterial medium using an appropriate E, coli host strain. The resulting plaques 
are lifted from the plates using nylon or nitrocellulose filters. The plaque lifts are processed 
through denaturation, neutralization, and washing treatments following the standard protocols 
outlined by Ausubel et aL (1992). The plaque lifts are hybridized to either radioactively 
labeled or non-radioactively labeled SDF DNA at room temperature for about 16 hours, 
usually in the presence of 50% formamide and 5X SSG (sodium chloride and sodium citrate) 
buffer and blocking reagents. The plaque lifts are then washed at 42''G with 1% Sodium 
Dodecyl Sulfate (SDS) and at a particular concentration of SSG. The SSG concentration used 
is dependent upon the stringency at which hybridization occurred in the initial Southern blot 
analysis performed. For example, if a fragment hybridized under medium stringency (e.g., 
Tm - 20''G), then this condition is maintained or preferably adjusted to a less stringent 
condition (e.g., Tm-30°C) to wash the plaque lifts. Positive clones show detectable 
hybridization e.g., by exposure to X-ray films or chromogen formation. The positive clones 
are then subsequently isolated for purification using the same general protocol outlined 
above. Once the clone is purified, restriction analysis can be conducted to narrow the region 
corresponding to the gene of interest. The restriction analysis and succeeding subcloning 
steps can be done using procedures described by, for example Sambrook et al. (1989) cited 
above. 

The procedures outlined for the lambda library are essentially similar to those used for 
YAG library screening, except that the YAG clones are harbored in bacterial colonies. The 
YAG clones are plated out at reasonable density on nitrocellulose or nylon filters supported 
by appropriate bacterial medium in petri plates. Following the growth of the bacterial clones, 
the filters are processed through the denaturation, neutralization, and washing steps following 
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the procedures of Ausubel et al. 1992. The same hybridization procedures for lambda library 
screening are followed. 

To isolate cDNA, similar procedures using appropriately modified vectors are 
employed. For instance, the library can be constructed in a lambda vector appropriate for 
cloning cDNA such as Xgtll. Alternatively, the cDNA library can be made in a plasmid 
vector. cDNA for cloning can be prepared by any of the methods known in the art, but is 
preferably prepared as described above. Preferably, a cDNA library will include a high 
proportion of full-length clones. 

5. Isolating and/or Identifying Orthologous Genes 
Probes and primers of the invention can be used to identify and/or isolate 
polynucleotides related to those in Table 1. Related polynucleotides are those that are native to 
other plant organisms and exhibit either similar sequence or encode polypeptides with similar 
biological activity. One specific example is an orthologous gene. Orthologous genes have the 
same functional activity. As such, orthologous genes may be distinguished from homologous 
genes. The percentage of identity is a function of evolutionary separation and, in closely related 
species, the percentage of identity can be 98 to 100%. The amino acid sequence of a protein 
encoded by an orthologous gene can be less than 75% identical, but tends to be at least75% or at 
least 80% identical, more preferably at least 90%, most preferably at least 95% identical to the 
amino acid sequence of the reference protein. 

To find orthologous genes, the probes are hybridized to nucleic acids from a species of interest 
under low stringency conditions, preferably one where sequences containing as much as 40-45% 
mismatches will be able to hybridize. This condition is established by T^n - 40''C to Tm - 48''C 
(see below). Blots are then washed under conditions of increasing stringency. It is preferable 
that the wash stringency be such that sequences that are 85 to 100% identical will hybridize. 
More preferably, sequences 90 to 100% identical will hybridize and most preferably only 
sequences greater than 95% identical will hybridize. One of ordinary skill in the art will 
recognize that, due to degeneracy in the genetic code, amino acid sequences that are identical 
can be encoded by DNA sequences as little as 67% identical or less. Thus, it is preferable, for 
example, to make an overlapping series of shorter probes, on the order of 24 to 45 nucleotides, 
and individually hybridize them to the same arrayed library to avoid the problem of degeneracy 
introducing large numbers of mismatches. 
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As evolutionary divergence increases, genome sequences also tend to diverge. Thus, 
one of skill will recognize that searches for orthologous genes between more divergent 
species will require the use of lower stringency conditions compared to searches between 
closely related species. Also, degeneracy of the genetic code is more of a problem for 
searches in the genome of a species more distant evolutionarily from the species that is the 
source of the SDF probe sequences. 

Therefore the method described in Bouckaert et al., U.S. Ser. No. 60/121,700 Atty. 
Dkt. No. 2750-117P, Client Dkt. No. 00010.001, filed February 25, 1999, hereby 
incorporated in its entirety by reference, can be applied to the SDFs of the present invention 
to isolate related genes from plant species which do not hybridize to the com Arabidopsis, 
soybean, rice, wheat, and other plant sequences of Table 1. 

Identification of the relationship of nucleotide or amino acid sequences among plant 
species can be done by comparing the nucleotide or amino acid sequences of SDFs of the 
present application with nucleotide or amino acid sequences of other SDFs such as those 
present in applications listed in the table below: 



Country 1 Attorney No. 



United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 



Client No. 



2750-0301 P 

2750-0300P 

2750-0302P 

2750-0304P 

2750-0305P 

2750-0306P 

2750-0307P 

2750-0308P 

2750-0309P 

2750-031 1 P 

2750-031 OP 

2750-031 2P 

2750-031 3P 

2750-031 4P 

2750-031 5P 

2750-031 6P 

2750-031 7P 

2750-031 8P 

2750-031 9P 

2750-0320P 

2750-0321 P 

2750-0322P 

2750-0323P 

2750-0324P 

2750-0325P 

2750-0326P 

2750-0327P 



Filed i ^piicatlonr Ha. 



80002.001 

80001 .001 

80003.001 

80004.001 

80005.001 

80006.001 

80007.001 

80008.001 

80009.001 

80011.001 

80010.001 

80012.001 

80013.001 

80014.001 

80015.001 

80016.001 

80017.001 

80018.001 

80019.001 

80020.001 

80021 .001 

80022.001 

80023.001 

80024.001 

80025.001 

80026.001 

80027.001 



9/4/98 
9/4/98 
9/1 1 /98 
9/1 7/98 
9/1 8/98 
9/21/98 
9/24/98 
9/30/98 
9/30/98 
1 0/5/98 
1 0/5/98 
1 0/6/98 
1 0/8/98 
1 0/9/98 
1 0/1 3/98 
1 0/1 4/98 
1 0/1 6/98 
10/19/98 
1 0/20/98 
1 0/21 /98 
1 0/22/98 
1 0/28/98 
1 0/27/98 
1 0/29/98 
1 0/30/98 
11/2/98 
11/6/98 



60/099,672 

60/099,671 

60/099,933 

60/1 00,864 

60/101,042 

60/101,255 

60/1 01 ,682 

60/1 02,533 

60/102,460 

60/103,141 

60/103,116 

60/103,215 

60/1 03,554 

60/103,574 

60/103,907 

60/1 04,268 

60/104,680 

60/1 04,828 

60/1 05,008 

60/1 05,142 

60/105,533 

60/105,571 

60/1 05,81 5 

60/106,105 

60/106,218 

60/1 06,685 

60/1 07,282 
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United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 



t Attorney No. 

2750-0329P 

2750-0328 P 

2750-0330P 

2750-0331 P 

2750-0332P 

2750-0333P 

2750-0334P 

2750-0335P 

2750-0336P 

2750-0337P 

2750-0339P 

2750-0338P 

2750-0340 P 

2750-0341 P 

2750-0342P 

2750-0343P 

2750-0344P 

2750-0345P 

2750-0346P 

2750-0347P 

2750-0348P 

2750-0349P 

2750-0350P 

2750-0351 P 

2750-0352P 

2750-0353P 

2750-0354P 

2750-0355P 

2750-0356P 

2750-0357P 

2750-0358P 

2750-0359P 

2750-0360P 

2750-0361 P 

2750-0367P 

2750-0364P 

2750-0363P 

2750-0362P 

2750-0366P 

2750-0365P 

2750-0370P 

2750-0369P 

2750-0368P 

2750-0371 P 

2750-0372P 

2750-0373P 

2750-0374P 

2750-0376P 

2750-0375P 

2750-0377P 



Of lent Ho. 



80029.001 

80028.001 

80030.001 

80031 .001 

80032.001 

80033.001 

80034.001 

80035.001 

80036.001 

80037.001 

80039.001 

80038.001 

80040.001 

80041 .001 

80042.001 

80043.001 

80044.001 

80045.001 

80046.001 

80047.001 

80048.001 

80049.001 

80050.001 

80051 .001 

80052.001 

80053.001 

80054.001 

80055.001 

80056.001 

80057.001 

80058.001 

80059.001 

80060.001 

80061 .001 

80067.001 

80064.001 

80063.001 

80062.001 

80066.001 

80065.001 

80070.001 

80069.001 

80068.001 

80071 .001 

80072.001 

80073.001 

80074.001 

80076.001 

80075.001 

80077.001 



galled { AppHeatlOft No. 



11/9/98 
11/9/98 
11/10/98 
11/12/98 
11/16/98 
11/17/98 
11/19/98 
11/19/98 
11/20/98 
11/23/98 
11/25/98 
11/25/98 
11/27/98 
11/30/98 
12/1/98 
1 2/2/98 
1 2/3/98 
12/7/98 
1 2/9/98 
12/10/98 
12/11/98 
1 2/1 4/98 
1 2/1 5/98 
12/16/98 
12/17/98 
12/18/98 
12/21/98 
1 2/22/98 
1 2/23/98 
1 2/28/98 
1 2/29/98 
1 2/30/98 
1/4/99 
1/6/99 
1/7/99 
1/7/99 
1/7/99 
1/7/99 
1/7/99 
1/7/99 
1/8/99 
1/8/99 
1/8/99 
1/11/99 
1/12/99 
1/13/99 
1/14/99 
1/15/99 
1/15/99 
1 /1 9/99 



60/107,719 
60/1 07,720 
60/1 07,836 
60/108,190 
60/1 08,526 
60/1 08,901 
60/109,124 
60/109,127 
60/1 09,267 
60/1 09,594 
60/1 1 0,050 
60/1 1 0,053 
60/110,158 
60/1 1 0,263 
60/1 1 0,495 
60/1 1 0,626 
60/1 1 0,701 
60/1 1 1 ,339 
60/1 1 1 ,589 
60/1 1 1 ,782 
60/111,812 
60/1 1 2,096 
60/1 1 2,224 
60/112,624 
60/112,862 
60/112,912 
60/113,248 
60/1 1 3,522 
60/1 1 3,826 
60/113,998 
60/1 1 4,384 
60/114,455 
60/114,740 
60/114,866 
60/115,154 
60/115,151 
60/115,152 
60/115,153 
60/115,156 
60/115,155 
60/115,293 
60/115,365 
60/115,364 
60/1 1 5,339 
60/115,518 
60/115,847 
60/1 1 5,905 
60/1 1 6,384 
60/1 1 6,383 
60/1 1 6,329 



.Attorney No. 2750-1300P 



Coyntrv 



United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
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Ciieiit No. 



2750-0378P 

2750-0379P 

2750-0380P 

2750-0382P 

2750-0381 P 

2750-0383P 

2750-0384P 

2750-G385P 

2750-0386P 

2750-0387P 

2750-0388P 

2750-0389P 

2750-0391 P 

2750-0393P 

2750-0392P 

2750-0394P 

2750-0395P 

2750-0396P 

2750-0397P 

2750-0390P 

2750-0398P 

2750-0399P 

2750-0400P 

2750-0401 P 

2750-0402P 

2750-0403P 

2750-0405P 

2750-0404P 

2750-0406P 

2750-0407P 

2750-0409P 

2750-0408P 

2750-041 OP 

2750-041 1 P 

2750-041 2P 

2750-041 3P 

2750-041 4P 

2750-041 5P 

2750-041 6P 

2750-041 7P 

2750-041 8P 

2750-0420P 

2750-0421 P 

2750-0303P 

2750-0422P 

2750-0423P 

2750-0424P 

2750-0425P 

2750-0426P 

2750-0427P 



80078.001 

80079.001 

80080.001 

80082.001 

80081 .001 

80083.001 

80084.001 

80085.001 

80086.001 

80087.001 

80088.001 

80089.001 

80091.001 

80093.001 

80092.001 

80094.001 

80095.001 

80096.001 

80097.001 

80090.001 

80098.001 

80099.001 

80100.001 

80101.001 

80102.001 

80103.001 

80105.001 

80104.001 

80106.001 

80107.001 

80109.001 

80108.001 

80110.001 

801 1 1 .001 

80112.001 

80113.001 

80114.001 

80115.001 

91 000.001 

91001.001 

80118.001 

80120.001 

80121.001 

80115.002 

80122.001 

80123.001 

80124.001 

80125.001 

80126.001 

80127.001 



1/19/99 
1/21/99 
1/21/99 
1/22/99 
1/22/99 
1/28/99 
2/3/99 
2/4/99 
2/5/99 
2/8/99 
2/9/99 
2/1 0/99 
2/1 2/99 
2/16/99 
2/16/99 
2/18/99 
2/22/99 
2/23/99 
2/24/99 
2/25/99 
2/25/99 
2/26/99 
3/1 /99 
3/2/99 
3/3/99 
3/4/99 
3/5/99 
3/5/99 
3/9/99 
3/9/99 
3/1 0/99 
3/1 0/99 
3/1 1 /99 
3/1 2/99 
3/23/99 
3/25/99 
3/29/99 
4/1 /99 
4/6/99 
4/8/99 
4/1 6/99 
4/1 9/99 
4/21 /99 
4/23/99 
4/23/99 
4/28/99 
4/30/99 
4/30/99 
5/4/99 
5/5/99 



60/1 1 6,340 
60/1 1 6,674 
60/1 1 6,672 
60/1 1 6,962 
60/1 1 6,960 
60/1 1 7,756 
60/1 1 8,672 
60/1 1 8,808 
60/118,778 
60/1 1 9,029 
60/1 1 9,332 
60/119,462 
60/119,922 
60/120,198 
60/1 20,196 
60/1 20,583 
60/121,072 
60/121,334 
60/121,470 
60/121,825 
60/121,704 
60/122,107 
60/1 22,266 
60/1 22,568 
60/122,611 
60/121,775 
60/123,180 
60/1 23,534 
60/1 23,680 
60/123,548 
60/1 23,726 
60/123,715 
60/124,263 
60/123,941 
60/125,788 
60/1 26,264 
60/1 26,785 
60/1 27,462 
60/128,234 
60/128,714 
60/129,845 
60/1 30,077 
60/1 30,449 
60/130,510 
60/130,891 
60/131,449 
60/132,407 
60/132,048 
60/1 32,484 
60/1 32,485 
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Client No> 



United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 



2750-0428 P 

2750-0429P 

2750-0430P 

2750-0431 P 

2750-0433P 

2750-0432P 

2750-0434P 

2750-0435 P 

2750-0436P 

2750-0437P 

2750-0438 P 

2750-0439P 

2750-0440 P 

2750-0441 P 

2750-0442P 

2750-0444P 

2750-0445P 

2750-0446P 

2750-0447 P 

2750-0449P 

2750-0450P 

2750-0458P 

2750-0457P 

2750-0463 P 

2750-0462 P 

2750-0461 P 

2750-0464P 

2750-0454P 

2750-0460P 

2750-0466P 

2750-0465P 

2750-0459P 

2750-0455P 

2750-0453P 

2750-0452P 

2750-0451 P 

2750-0448P 

2750-0443P 

2750-0456P 

2750-0467P 

2750-0468P 

2750-0469P 

2750-0470P 

2750-0471 P 

2750-0472P 

2750-0473P 

2750-0474P 

2750-0475P 

2750-0476P 

2750-0477P 



91002.001 

80129.001 

80130.001 

80131.001 

00025.001 

91006.001 

80116.001 

80117.001 

91007.001 

91008.001 

91009.001 

91010.001 

91011.001 

91012.001 

91013.001 

91014.001 

91015.001 

91016.001 

91017.001 

91018.001 

91019.001 

00033.002 

00033.001 

00034.001 

80132.012 

80132.011 

00037.001 

80132.006 

80132.010 

00039.001 

00038.001 

80132.009 

80132.007 

80132.005 

80132.004 

80132.003 

80132.002 

80132.001 

80132.008 

00042.001 

00043.001 

00044.001 

00042.002 

00045.001 

00046.001 

00048.001 

00049.001 

00050.001 

00051.001 

00052.001 
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5/6/99 
5/6/99 
5/7/99 
5/11/99 
5/1 4/99 
5/1 4/99 
5/14/99 
5/1 4/99 
5/1 8/99 
5/1 9/99 
5/20/99 
5/21/99 
5/24/99 
5/25/99 
5/27/99 
5/28/99 
6/1/99 
6/3/99 
6/4/99 
6/7/99 
6/8/99 
6/1 0/99 
6/1 0/99 
6/1 4/99 
6/1 6/99 
6/1 6/99 
6/1 7/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/1 8/99 
6/18/99 
6/1 8/99 
6/21/99 
6/22/99 
6/23/99 
6/23/99 
6/24/99 
6/28/99 
6/29/99 
6/30/99 
7/1/99 
7/1/99 
7/2/99 



60/1 32,487 
60/1 32,486 
60/1 32,863 
60/1 34,256 
60/134,221 
60/1 34,370 
60/134,219 
60/134,218 
60/134,768 
60/134,941 
60/135,124 
60/1 35,353 
60/1 35,629 
60/136,021 
60/136,392 
60/1 36,782 
60/1 37,222 
60/137,528 
60/1 37,502 
60/1 37,724 
60/138,094 
60/138,847 
60/1 38,540 
60/139,119 
60/1 39,452 
60/139,453 
60/139,492 
60/1 39.457 
60/1 39,455 
60/1 39,750 
60/1 39,763 
60/139,463 
60/139,460 
60/139,462 
60/139,461 
60/1 39,459 
60/1 39,454 
60/1 39,458 
60/1 39,456 
60/139,817 
60/139,899 
60/140,354 
60/140,353 
60/1 40,695 
60/1 40,823 
60/140,991 
60/1 41 ,287 
60/1 41 ,842 
60/142,154 
60/142,055 
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Country 



United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
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Client No, 



2750-0478 P 

2750-0479P 

2750-0480P 

2750-0481 P 

2750-0482 P 

2750-0489P 

2750-0490P 

2750-0486P 

2750-0485P 

2750-0494P 

2750-0497P 

2750-0488 P 

2750-0495 P 

2750-0492 P 

2750-0496P 

2750-0499P 

2750-0500 P 

2750-0502 P 

2750-0483 P 

2750-0484P 

2750-0503 P 

2750-0491 P 

2750-0493 P 

2750-0487P 

2750-0504P 

2750-0505P 

2750-0498P 

2750-0501 P 

2750-0506P 

2750-0508P 

2750-0509P 

2750-0507P 

2750-051 OP 

2750-051 1P 

2750-051 2P 

2750-051 3P 

2750-051 4P 

2750-051 7P 

2750-051 5P 

2750-051 8P 

2750-051 9P 

2750-051 6P 

2750-0520P 

2750-0523P 

2750-0521 P 

2750-0522P 

2750-0524P 

2750-0526P 

2750-0527P 

2750-0528P 



00053.001 

00054.001 

00058.001 

00059.001 

00080.001 

00061 .001 

00062.001 

80134.004 

80134.003 

80134.010 

00064.001 

801 34.006 

80134.013 

80134.008 

80134.014 

80134.012 

00065.001 

80135.002 

80134.001 

80134.002 

00066.001 

80134.007 

80134.009 

80134.005 

00067.001 

00069.001 

80134.011 

80135.001 

00070.001 

80136.002 

00071 .001 

80136.001 

00072.001 

80137.001 

80137.002 

00073.001 

00074.001 

80138.002 

00076.001 

00077.001 

80136.003 

80138.001 

00079.001 

80139.002 

00080.001 

80139.001 

00081.001 

80141.002 

80141.003 

80141.004 



Fil^ i Aopiication fto^ 



7/6/99 
7/8/99 
7/9/99 
7/1 2/99 
7/1 3/99 
7/1 4/99 
7/1 5/99 
7/1 6/99 
7/1 6/99 
7/1 9/99 
7/1 9/99 
7/1 9/99 
7/1 9/99 
7/1 9/99 
7/1 9/99 
7/20/99 
7/20/99 
7/20/99 
7/21/99 
7/21/99 
7/21/99 
7/22/99 
7/22/99 
7/22/99 
7/22/99 
7/23/99 
7/23/99 
7/23/99 
7/26/99 
7/27/99 
7/27/99 
7/27/99 
7/28/99 
8/2/99 
8/2/99 
8/2/99 
8/3/99 
8/4/99 
8/4/99 
8/5/99 
8/5/99 
8/6/99 
8/6/99 
8/9/99 
8/9/99 
8/1 0/99 
8/1 1 /99 
8/1 2/99 
8/12/99 
8/12/99 



60/142,390 
60/142,803 
60/1 42,920 
60/1 42,977 
60/143,542 
60/1 43,624 
60/1 44,005 
60/144,085 
60/144,086 
60/1 44,333 
60/1 44,325 
60/144,332 
60/144,335 
60/144,331 
60/1 44,334 
60/1 44,352 
60/144,632 
60/144,884 
60/145,088 
60/1 45,086 
60/144,814 
60/145,085 
60/145,087 
60/145,089 
60/145,192 
60/145,218 
60/145,145 
60/145,224 
60/145,276 
60/145,919 
60/145,913 
60/145,918 
60/145,951 
60/146,388 
60/146,389 
60/1 46,386 
60/147,038 
60/1 47,302 
60/1 47,204 
60/147,260 
60/147,192 
60/147,303 
60/1 47,416 
60/1 47,935 
60/1 47,493 
60/148,171 
60/1 48,31 9 
60/148,342 
60/1 48,340 
60/148,337 
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Comitrv 



United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
International 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
International 
International 
United States 
United States 
International 
United States 
United States 
United States 
United States 
United States 
United States 
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orient No. 



2750-0530 P 

2750-0525P 

2750-0529P 

2750-0532P 

2750-0531 P 

2750-0534P 

2750-0537P 

2750-0536P 

2750-0535 P 

2750-0533P 

2750-0538P 

2750-0539P 

2750-0541 P 

2750-0542P 

2750-0540P 

2750-0543P 

2750-0544P 

2750-0547P 

2750-0548P 

2750-0546P 

2750-0545P 

2750-0549P 

2750-0552P 

2750-0553P 

2750-0550P 

2750-0551 F(PC) 

2750-0554 P 

2750-0555P 

2750-0556P 

2750-0557P 

2750-0558P 

2750-0559P 

2750-0560P 

2750-0561 P 

2750-0562P 

2750-0563P 

2750-0564P 

2750-0570P 

2750-0571 P 

2750-0568F{PC) 

2750-0569F(PC) 

2750-0565P 

2750-0566P 

2750-0567F(PC) 

2750-0572P 

2750-0575P 

2750-0576P 

2750-0577P 

2750-0583P 

2750-0579P 



00082.001 

80141.001 

00083.001 

80142.002 

80142.001 

80001 .003 

00084.001 

80001 .005 

80001 .004 

80001 .002 

00085.001 

00086.001 

80143.002 

00087.001 

80143.001 

00088.001 

00089.001 

00090.001 

00091 .001 

80144.002 

80144.001 

00092.001 

00093.001 

00094.001 

80001 .006 

80001.100 

00095.001 

00096.001 

00098.001 

00099.001 

00101.001 

00102.001 

00103.001 

00104.001 

00105.001 

00106.001 

00107.001 

00108.001 

00109.001 

80010.101 

80010.102 

80010.002 

80010.003 

80010.100 

00110.001 

00111.001 

00112.001 

00113.001 

80148.002 

80146.002 



Filed I Appiication tto. 



8/1 2/99 
8/12/99 
8/1 3/99 
8/1 3/99 
8/1 6/99 
8/1 7/99 
8/1 7/99 
8/1 7/99 
8/1 7/99 
8/1 7/99 
8/1 8/99 
8/20/99 
8/20/99 
8/20/99 
8/23/99 
8/23/99 
8/25/99 
8/26/99 
8/27/99 
8/27/99 
8/27/99 
8/30/99 
8/31/99 
9/1/99 
9/3/99 
9/3/99 
9/7/99 
9/10/99 
9/1 3/99 
9/1 5/99 
9/1 6/99 
9/20/99 
9/22/99 
9/23/99 
9/24/99 
9/28/99 
9/29/99 
1 0/4/99 
1 0/5/99 
1 0/5/99 
1 0/5/99 
1 0/5/99 
1 0/5/99 
1 0/5/99 
1 0/6/99 
1 0/7/99 
1 0/8/99 
1 0/1 2/99 
1 0/1 3/99 
1 0/1 3/99 



60/148,341 
60/148,347 
60/1 48,565 
60/1 48,684 
60/149,368 
60/1 49,928 
60/149,175 
60/1 49,925 
60/149,926 
60/1 49,927 
60/1 49,426 
60/1 49,722 
60/149,929 
60/149,723 
60/149,930 
60/1 49,902 
60/1 50,566 
60/1 50,884 
60/151,080 
60/151,066 
60/151,065 
60/1 51 ,303 
60/151,438 
60/1 51 ,930 
09/391 ,631 
99/204,38 
60/1 52,363 
60/1 53,070 
60/1 53,758 
60/1 54,01 8 
60/1 54,039 
60/154,779 
60/155,139 
60/1 55,486 
60/155,659 
60/1 56,458 
60/1 56,596 
60/157,117 
60/1 57,753 
99/228,54 
99/228,53 
09/413,198 
09/412,922 
99/228,55 
60/1 57,865 

60/158,029 
60/1 58,232 
60/158,369 
60/159,294 
60/159,293 
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Country f Attorney No. I CHentHo. i Fiied I AppHcatlon No. 



United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 



2750-0574P 
2750-0573 P 
2750-0580 P 
2750-0581 P 
2750-0582P 

2750-0578P 

2750-0584 P 

2750-0590P 

2750-0589P 

2750-0588P 

2750-0587 P 

2750-0586P 

2750-0585P 

2750-0591 P 

2750-0592P 

2750-0593P 

2750-0594P 

2750-0595P 

2750-0596P 

2750-0599P 

2750-0597P 

2750-0598P 

2750-0603P 

2750-0602P 

2750-0601 P 

2750-0600P 

2750-0605P 

2750-0606P 

2750-0604P 

2750-0607P 

2750-0608P 

2750-0609P 

2750-061 OP 

2750-061 1 P 

2750-061 2P 

2750-061 5P 

2750-061 3P 

2750-061 4P 

2750-061 6P 

2750-061 7P 

2750-061 8P 

2750-0620P 

2750-061 9P 

2750-0621 P 

2750-0625P 

2750-0623P 

2750-0628P 

2750-0629P 

2750-0627P 

2750-0626P 



80145.002 
80145.001 
80147.001 
801 47.002 
80148.001 
80146.001 
00116.001 
80150.002 
80150.001 
00119.001 
80149.002 
80149.001 
00118.001 
00120.001 
80151.001 
80151.002 
00121.001 
80152.001 
80152.002 
80153.002 
00122.001 
80153.001 
80154.002 
80154.001 
00123.001 
80026.002 
80155.001 
80155.002 
00124.001 
00125.001 
80156.001 
80156.002 
00126.001 
80157.001 
80157.002 
80158.002 
00127.001 
80158.001 
00128.001 
80159.001 
80159.002 
80160.001 
00129.001 
80160.002 
80162.002 
80161.002 
00131.001 
80164.001 
80163.002 
80163.001 



1 0/1 3/99 
1 0/1 4/99 
1 0/1 4/99 
1 0/1 4/99 
1 0/1 4/99 
1 0/1 4/99 
1 0/1 8/99 
10/21/99 
10/21/99 
10/21/99 
10/21/99 
10/21/99 
10/21/99 
1 0/22/99 
1 0/22/99 
1 0/22/99 
1 0/25/99 
1 0/25/99 
1 0/25/99 
1 0/26/99 
1 0/26/99 
1 0/26/99 
1 0/28/99 
1 0/28/99 
1 0/28/99 
1 0/28/99 
1 0/29/99 
1 0/29/99 
1 0/29/99 
11/1/99 
11/1/99 
11/1/99 
1 1 /2/99 
11/2/99 
11/2/99 
11/3/99 
11/3/99 
11/3/99 
11/4/99 
11/4/99 
11/4/99 
11/8/99 
11/8/99 
11/8/99 
11/9/99 
11/9/99 
11/10/99 
11/10/99 
1 1 /1 0/99 
11/10/99 



60/159,295 
60/1 59,330 
60/1 59,638 
60/1 59,637 
60/159,329 
60/159,331 
60/1 59,584 
60/1 60,767 
60/1 60,768 
60/1 60,741 
60/1 60,770 
60/160,814 
60/160,815 
60/1 60,980 
60/1 60,989 
60/1 60,981 
60/161,405 
60/1 61 .406 
60/1 61 ,404 
60/1 61 ,359 
60/1 61 ,361 
60/1 61 ,360 
60/1 61 ,993 
60/1 61 ,992 
60/1 61 ,920 
09/428,944 
60/162,142 
60/162,228 
60/162,143 
60/162,894 
60/1 62,891 
60/162,895 
60/163,093 
60/163,092 
60/163,091 
60/163,281 
60/163,249 
60/163,248 
60/1 63,379 
60/1 63,381 
60/1 63,380 
60/1 64,151 
60/164,146 
60/164,150 
60/1 64,259 
60/164,260 
60/1 64,544 
60/1 64,545 
60/164,318 
60/164,321 



Attorney No. 2750-1300P 



Countrv f Attorney No. 



33 

CHent No, 



United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750- 

United States 2750- 

United States 2750- 



■0622 P 
■0630P 
■0624 P 
0631 P 
0636P 
0635P 
0634P 
0632P 
0633 P 
0637P 
0638P 
0639P 
-0642P 
-0640P 
-0641 P 
-0643P 
-0644P 
-0645P 
-0646P 
-0647P 
-0648P 
-0650P 
-0651 P 
-0649P 
-0653P 
■0652P 
-0655P 
■0658P 
-0654P 
■0656P 
•0657P 
■0659P 
■0660P 
0661 P 
0665P 
0662P 
0664P 
0663P 
0668P 
0667P 
0666P 
0669P 
0670P 
0671 P 
0672P 
0673P 
0674P 
0676P 
0675P 
0678P 



80161.001 

801 64.002 

80162.001 

00132.001 

80166.002 

80166.001 

00133.001 

80165.001 

80165.002 

00134.001 

80167.001 

80167.002 

80168.002 

00135.001 

80168.001 

00136.001 

80169.001 

80169.002 

00137.001 

80170.001 

80170.002 

80171.001 

801 71 .002 

00139.001 

80172.001 

00140.001 

80173.002 

80174.002 

80173.001 

00141.001 

80174.001 

00142.001 

80175.001 

80175.002 

80176.002 

80042.002 

80176.001 

00143.001 

80177.002 

80177.001 

00144.001 

00145.001 

80178.001 

80178.002 

00147.001 

80179.001 

80179.002 

80180.002 

80180.001 

80181.001 



Filed I ADpHcatlom tto. 



11/10/99 
11/10/99 
11/10/99 
11/12/99 
11/12/99 
11/12/99 
11/12/99 
11/12/99 
11/12/99 
11/15/99 
11/15/99 
11/15/99 
11/16/99 
11/16/99 
11/16/99 
11/17/99 
11/17/99 
11/17/99 
11/18/99 
11/18/99 
1 1 /1 8/99 
1 1 /1 9/99 
11/19/99 
11/19/99 
11/22/99 
11/22/99 
11/23/99 
1 1 /24/99 
1 1 /24/99 
1 1 /24/99 
1 1 /24/99 
11/30/99 
11/30/99 
11/30/99 
1 2/1 /99 
1 2/1 /99 
1 2/1 /99 
12/1/99 
1 2/2/99 
1 2/2/99 
1 2/2/99 
1 2/3/99 
1 2/3/99 
1 2/3/99 
1 2/7/99 
1 2/7/99 
1 2/7/99 
1 2/8/99 
1 2/8/99 
1 2/1 6/99 



60/164,319 
60/1 64,548 
60/164,317 
60/1 64,961 
60/1 64,962 
60/1 64,959 
60/1 64,870 
60/1 64,871 
60/1 64,960 
60/1 64,927 
60/1 64,929 
60/1 64,926 
60/1 65,661 
60/1 65,669 
60/1 65,671 
60/165,919 
60/1 65,91 8 
60/165,911 
60/166,157 
60/166,173 
60/166,158 
60/166,411 
60/166,412 
60/166,419 
60/1 66,750 
60/1 66,733 
60/1 67,362 
60/167,235 
60/167,382 
60/1 67,233 
60/1 67,234 
60/1 67,904 
60/1 67,908 
60/1 67,902 
60/168,231 
09/451 ,320 
60/168,233 
60/1 68,232 
60/1 68,548 
60/1 68,549 
60/168,546 
60/168,675 
60/1 68,673 
60/1 68,674 
60/1 69,298 
60/169,278 
60/1 69,302 
60/169,691 
60/169,692 
60/171,114 
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United States 
United States 
United States 
United States 
International 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
Europe 
Mexico 
Canada 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 



ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 



2750-0679P 

2750-0677P 

2750-0683 P 

2750-0684P 

2750-0686F(PC) 

2750-0688P 

2750-0681 P 

2750-0685 P 

2750-0689P 

2750-0691 P 

2750-0687P 

2750-0682 P 

2750-0680 P 

2750-0690P 

2750-0692P 

2750-0693P 

2750-0695P 

2750-0696P 

2750-0697P 

2750-0698P 

2750-0694P 

2750-0700P 

2750-0699P 

2750-0701 P 

2750-0702P 

2750-0703P 

2750-0704P 

2750-0705P 

2750-0706P 

2750-0707P 

2750-071 3P 

2750-0708P 

2750-071 2P 

2750-071 4P 

2750-071 5P 

2750-071 7P 

2750-071 6P 

2750-0709F(EP) 

2750-0709F(MX) 

2750-0709F{CA) 

2750-071 9P 

2750-071 8P 

2750-0709P 

2750-0720P 

2750-0721 P 

2750-0722P 

2750-0723P 

2750-0724P 

2750-071 OP 

2750-0725P 



80181.002 

00149.001 

80060.002 

80070.002 

80070.100 

80184.002 

80182.002 

80183.002 

00152.001 

80185.001 

80184.001 

80183.001 

80182.001 

00153.001 

00155.001 

80186.001 

00157.001 

80187.001 

00158.001 

80188.001 

80084.002 

80189.001 

00159.001 

00160.001 

80190.001 

00161.001 

80191.001 

00162.001 

80192.001 

00163.001 

80194.001 

80193.001 

00164.001 

00165.001 

80195.001 

80196.001 

00167.001 

80090.103 

80090.101 

80090.102 

00168.001 

91022.001 

80090.002 

80197.001 

91023.001 

00169.001 

80198.001 

91024.001 

80100.002 

00170.001 



1 2/1 6/99 
1 2/1 6/99 
1/4/00 
1/7/00 
1/7/00 
1/19/00 
1/19/00 
1/19/00 
1/26/00 
1/27/00 
1/27/00 
1/27/00 
1/27/00 
1/27/00 
1/28/00 
1/28/00 
2/1/00 
2/1/00 
2/3/00 
2/3/00 
2/3/00 
2/4/00 
2/4/00 
2/7/00 
2/7/00 
2/9/00 
2/9/00 
2/1 0/00 
2/1 0/00 
2/15/00 
2/15/00 
2/1 5/00 
2/1 5/00 
2/17/00 
2/17/00 
2/24/00 
2/24/00 
2/25/00 
2/25/00 
2/25/00 
2/25/00 
2/25/00 
2/25/00 
2/25/00 
2/28/00 
2/28/00 
2/28/00 
2/29/00 
3/1 /OO 
3/1 /OO 



60/171,098 
60/171,107 
09/478,081 
09/479,221 
00/004,66 
60/1 76.91 0 
60/1 76,866 
60/1 76,867 
60/178,166 
60/1 77,666 
60/1 78,545 
60/1 78,546 
60/1 78,544 
60/1 78,547 
60/1 78,754 
60/1 78,755 
60/1 79,395 
60/1 79,388 
60/1 80,039 
60/180,139 
09/497,191 
60/1 80,207 
60/1 80,206 
60/1 80,695 
60/180,696 
60/1 81 ,228 
60/181,214 
60/181,476 
60/181,551 
60/1 82,477 
60/1 82,478 
60/182,516 
60/182,512 
60/183,166 
60/183,165 
60/1 84,658 
60/184,667 
00/301 ,439 
00/001 ,973 
23/006,92 
60/185,118 
60/1 85,140 
09/513,996 
60/185,119 
60/185,398 
60/1 85,396 
60/1 85,397 
60/1 85,750 
09/517,537 
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Couiitrv 1 Attorney No. I CilentNo 



United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 



2750-0726P 

2750-0727P 

2750-071 1 P 

2750-0728 P 

2750-0729P 

2750-0730 P 

2750-0731 P 

2750-0732P 

2750-0733 P 

2750-0735P 

2750-0734 P 

2750-0736P 

2750-0737P 

2750-0738P 

2750-0739P 

2750-0743P 

2750-0742P 

2750-0740P 

2750-0741 P 

2750-0744P 

2750-0745P 

2750-0746P 

2750-0747P 

2750-0748P 

2750-0749P 

2750-0753P 

2750-0756P 

2750-0757P 

2750-0750P 

2750-0752P 

2750-0754P 

2750-0751 P 

2750-0755P 

2750-0759P 

2750-0762P 

2750-0761 P 

2750-0758P 

2750-0760P 

2750-0764P 

2750-0763P 

2750-0767P 

2750-0766P 

2750-0765P 

2750-0769P 

2750-0770P 

2750-0768P 

2750-0772P 

2750-0773P 

2750-0771 P 

2750-0774P 



80199.001 

91025.001 

00171 .001 

80200.001 

00172.001 

80201 .001 

91026.001 

00173.001 

80202.001 

91027.001 

00174.001 

00175.001 

80203.001 

91028.001 

00177.001 

80205.001 

00178.001 

80204.001 

91030.001 

91031.001 

00179.001 

80206.001 

91032.001 

00180.001 

80207.001 

8021 1 .001 

80212.001 

91034.001 

80208.001 

80210.001 

91033.001 

80209.001 

00181.001 

80213.001 

80214.001 

00183.001 

00182.001 

91035.001 

80215.001 

00184.001 

80216.001 

00185.001 

91036.001 

00186.001 

80217.001 

91037.001 

00187.001 

80218.001 

91038.001 

91039.001 



Fiied I Appiicatlon Na 



3/1/00 
3/1/00 
3/2/00 
3/2/00 
3/2/00 
3/2/00 
3/3/00 
3/3/00 
3/3/00 
3/7/00 
3/7/00 
3/8/00 
3/8/00 
3/9/00 
3/1 0/00 
3/1 0/00 
3/1 0/00 
3/1 0/00 
3/1 0/00 
3/1 3/00 
3/1 4/00 
3/14/00 
3/15/00 
3/1 5/00 
3/1 5/00 
3/1 6/00 
3/1 6/00 
3/1 6/00 
3/16/00 
3/1 6/00 
3/16/00 
3/16/00 
3/1 6/00 
3/20/00 
3/20/00 
3/20/00 
3/20/00 
3/20/00 
3/22/00 
3/22/00 
3/23/00 
3/23/00 
3/23/00 
3/24/00 
3/24/00 
3/24/00 
3/27/00 
3/27/00 
3/27/00 
3/29/00 



60/1 86,296 
60/1 86,277 
60/1 86,390 
60/187,178 
60/1 86,386 
60/1 86,387 
60/1 86,670 
60/1 86,748 
60/1 86,669 
60/1 87,379 
60/1 87,378 
60/1 87,896 
60/1 87,888 
60/1 87,985 
60/188,187 
60/188,175 
60/188,185 
60/188,186 
60/188,174 
60/1 88,687 
60/189,080 
60/1 89,052 
60/1 89,460 
60/1 89,461 
60/189,462 
60/190,121 
60/189,959 
60/1 89,965 
60/190,120 
60/1 89,948 
60/1 89,958 
60/1 89,947 
60/1 89,953 
60/1 90,070 
60/190,089 
60/190,545 
60/190,069 
60/1 90,060 
60/191,097 
60/191,084 
60/191,545 
60/191,543 
60/191,549 
60/191,823 
60/1 91 ,825 
60/1 91 ,826 
60/192,421 
60/192,308 
60/192,420 
60/192,855 
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Courttrv 1 Attorney No. 



United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

Canada 2750- 

Europe 2750- 

United States 2750- 

United States 2750- 

Mexico 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 



■0775P 
■0776P 
■0777P 
0778P 
0779 P 
0782P 
0780P 
-0781 P 
-0785P 
-0787P 
-0786P 
-0792P 
-0791 P 

-0789P 
-0790P 
-0793P 
-0783F(CA) 
-0783F(EP) 
-0796P 
-0783P 
-0V83F(I\/IX) 
-0784P 
■0795P 
-0794P 
■0799P 
■0797P 
■0798P 
■0804P 
■0801 P 
0802P 
0803P 
0800P 
0805P 
0806P 
•0807P 
0809P 
0808P 
081 1 P 
081 OP 
081 4P 
081 6P 
081 9P 
081 2P 
081 3P 
081 7P 
081 5P 
081 8P 
0822P 
0821 P 
0820P 



Client HO. I mi^ 1 Apgll<^k>ft Ho 



00188.001 

80219.001 

91040.001 

00189.001 

80220.001 

80221 .001 

91041.001 

00190.001 

91042.001 

80222.001 

00191.001 

91044.001 

80223.001 

91043.001 

00192.001 

00193.001 

91000.102 

91000.101 

80225.001 

91000.002 

91000.100 

91045.001 

00194.001 

80224.001 

80226.001 

91046.001 

00195.001 

80228.001 

80227.002 

91047.001 

00196.001 

80227.001 

91048.001 

00197.001 

80229.001 

80230.001 

00200.001 

80231 .002 

80231.001 

91049.001 

80233.001 

80234.001 

80232.001 

80232.002 

91050.001 

00201 .001 

00202.001 

80235.001 

00203.001 

91051.001 



3/29/00 
3/29/00 
3/30/00 
3/30/00 
3/30/00 
3/31/00 
3/31/00 
3/31/00 
4/4/00 
4/4/00 
4/4/00 
4/5/00 
4/5/00 
4/5/00 
4/5/00 
4/6/00 
4/6/00 
4/6/00 
4/6/00 
4/6/00 
4/6/00 
4/6/00 
4/6/00 
4/6/00 
4/7/00 
4/7/00 
4/7/00 
4/1 1 /OO 
4/11/00 
4/1 1 /OO 
4/1 1 /OO 
4/12/00 
4/1 2/00 
4/1 2/00 
4/1 2/00 
4/1 2/00 
4/1 2/00 
4/1 3/00 
4/1 4/00 
4/1 4/00 
4/1 7/00 
4/1 7/00 
4/1 7/00 
4/1 7/00 
4/1 7/00 
4/1 7/00 
4/1 7/00 
4/1 9/00 
4/1 9/00 
4/19/00 



60/1 92,940 
60/1 92,941 
60/1 93,243 
60/193,244 
60/193,245 
60/1 93,455 
60/1 93,469 
60/1 93,453 
60/194,385 
60/194,398 
60/1 94,404 
60/1 94,698 
60/194,697 
60/1 94,682 
60/194,683 
60/194,874 
02/302,828 
00/302,919 
60/195,045 
09/543,680 
00/003,391 
60/194,884 
60/1 94,885 
60/1 94,872 
60/195,257 
60/195,258 
60/195,283 
60/1 96,089 
60/196,211 
60/196,168 
60/196,169 
60/196,212 
60/1 96,483 
60/1 96,487 
60/1 96,289 
60/1 96,486 
60/1 96,485 
60/1 96,21 3 

60/1 97,397 
60/197,678 
60/197,671 
60/197,870 
60/197,871 
60/198,268 
60/197,687 
60/198,133 
60/198,373 
60/198,386 
60/198,400 
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United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
Mexico 
United States 
United States 
United States 
United States 
United States 
Europe 
Canada 



Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 
Un 



ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 
ted States 



2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750- 
2750 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750- 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 
2750 



-0825P 
-0824P 
-0823P 
■0826P 
•0828P 
0827P 
0829P 
0830P 
0831 P 
0832P 
-0834P 
-0835P 
-0833P 
-0836P 
-0837P 
-0788P 
-0846P 
-0844P 
-0841 P 
-0842P 
-0843P 
-0847P 
-0845P 
-0839P 
-0840P 
-0848P 
•0849P 
■0850P 
-0838P 
•0851 F(MX) 
■0857P 
■0858P 
•0856P 
■0852P 
■0859P 
0851 F(EP) 
0851 F(CA) 
0860P 
0851 P 
0861 P 
0855 P 
0854P 
0853P 
0865P 
0863P 
0864P 
0866P 
0862P 
0877P 
0879P 



80236.001 

00204.001 

91052.001 

91053.001 

80237.001 

00206.001 

91054.001 

00207.001 

80238.001 

92001.001 

00208.001 

80239.001 

92002.001 

00210.001 

80240.001 

80123.002 

80243.002 

80242.002 

92001 .002 

92002.002 

80242.001 

80244.001 

80243.001 

80241 .001 

91055.001 

80244.002 

91056.001 

80245.001 

0021 1 .001 

91002.102 

00212.001 

80246.001 

91057.001 

80126.002 

91058.001 

91002.101 

91002.100 

00213.001 

91002.002 

80247.001 

80130.002 

80129.002 

80127.002 

0021 5.001 

80248.001 

91059.001 

80249.001 

0021 4.001 

91060.001 

80252.001 



4/20/00 
4/20/00 
4/20/00 
4/21/00 
4/21/00 
4/21/00 
4/24/00 
4/24/00 
4/24/00 
4/26/00 
4/26/00 
4/26/00 
4/26/00 
4/27/00 
4/27/00 
4/28/00 
4/28/00 
4/28/00 
5/1/00 
5/1 /OO 
5/1 /OO 
5/1 /OO 
5/1 /OO 
5/1 /OO 
5/1 /OO 
5/1/00 
5/2/00 
5/2/00 
5/2/00 
5/4/00 
5/4/00 
5/4/00 
5/4/00 
5/4/00 
5/5/00 
5/5/00 
5/5/00 
5/5/00 
5/5/00 
5/5/00 
5/5/00 
5/5/00 
5/5/00 
5/9/00 
5/9/00 
5/9/00 
5/9/00 
5/9/00 
5/1 0/00 
5/10/00 



60/1 98,623 
60/198,619 
60/198,629 
60/198,765 
60/1 98,763 
60/198,767 
60/199,123 
60/199,124 
60/199,122 
60/200,034 
60/1 99,828 
60/199,818 
60/200,031 
60/200,103 
60/200,102 
09/559,232 
60/200,773 
60/200,373 
60/201 ,01 7 
60/201,018 
60/200,763 
60/200,762 
60/201 ,01 6 
60/200,879 
60/200,885 
60/200,761 
60/201 ,279 
60/201 ,305 
60/201 ,275 
00/004,406 
60/201 ,740 
60/201 ,750 
60/201 ,751 
09/566,262 
60/202,178 
00/303,770 
02/305,695 
60/202,112 
09/565,308 
60/202,180 
09/565,310 
09/565,307 
09/565,309 
60/202,91 9 
60/202,636 
60/202,915 
60/202,634 
60/202,91 4 
60/202,969 
60/202,963 
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United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

Mexico 2750 

United States 2750 

Europe 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750- 

United States 2750- 

Canada 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

Canada 2750- 

United States 2750- 

Europe 2750- 

United States 2750- 

Mexico 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 

United States 2750 



-0878P 
0870P 
0869P 
-0871 P 
0881 P 
0880P 
0868P 
0867P 
-0882P 
-0875F(IVIX) 
-0873P 
-0875F{EP) 
-0874P 
-0875P 
-0883P 
-0884 P 
-0885P 
-0875F(CA) 
■0872P 
-0886P 
-0888P 
-0887P 
-0891 P 
-0892P 
-0894P 
■0893P 
■0890P 
■0889P 
•0876F(CA) 
-0896P 
•0876F(EP) 
•0876P 
0876F(MX) 
0895P 
0898P 
0897P 
0900P 
0899P 
0901 P 
0903P 
0902P 
0905P 
0904P 
0906P 
0908P 
091 OP 
091 1 P 
0907P 
0909P 
091 3P 



00216.001 

80251 .002 

80251 .001 

80131.002 

00217.001 

91061.001 

80250.002 

80250.001 

80253.001 

91006.102 

00025.002 

91006.101 

80116.002 

91006.002 

91062.001 

00219.001 

80254.001 

91006.100 

80117.002 

91063.001 

80255.001 

00220.001 

00221 .001 

80256.001 

80257.001 

00222.001 

92002.003 

92001 .003 

91007.100 

80258.001 

91007.101 

91 007.002 

91007.102 

00223.001 

80259.001 

00224.001 

00225.001 

91 064.001 

80260.001 

80261 .001 

00226.001 

80262.001 

00227.001 

91 065.001 

80263.001 

00229.001 

80264.001 

00228.001 

91 066.001 

00230.001 
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5/1 0/00 
5/11/00 
5/1 1 /OO 
5/1 1 /OO 
5/1 1 /OO 
5/1 1 /OO 
5/11/00 
5/11/00 
5/11/00 
5/1 2/00 
5/1 2/00 
5/12/00 
5/1 2/00 
5/12/00 
5/1 2/00 
5/1 2/00 
5/1 2/00 
5/12/00 
5/12/00 
5/15/00 
5/1 5/00 
5/1 5/00 
5/1 6/00 
5/1 6/00 
5/1 7/00 
5/17/00 
5/1 7/00 
5/1 7/00 
5/1 8/00 
5/1 8/00 
5/1 8/00 
5/1 8/00 
5/1 8/00 
5/1 8/00 
5/1 9/00 
5/1 9/00 
5/22/00 
5/22/00 
5/22/00 
5/23/00 
5/23/00 
5/24/00 
5/24/00 
5/25/00 
5/26/00 
5/26/00 
5/26/00 
5/26/00 
5/26/00 
5/30/00 



60/202,968 
60/203,622 
60/203,669 
09/572,408 
60/203,457 
60/203,458 
60/203,672 
60/203,671 
60/203,279 
00/004,628 
09/570,582 
00/304,01 7 
09/570,738 
09/570,581 
60/203,91 1 
60/203,91 6 
60/203,915 
02/306,232 

09/570,768 
60/204,395 
60/204,122 
60/204,388 
60/204,568 
60/204,569 
60/204,829 
60/204,830 
60/205,325 
60/205,233 
02/306,202 
60/205,058 
00/304,161 
09/573,655 
00/004,850 
60/205,201 
60/205,243 
60/205,242 
60/205,572 
60/205,574 
60/205,576 
60/206,319 
60/206,31 6 
60/206,545 
60/206,553 
60/206,988 
60/207,243 
60/207,239 
60/207,354 
60/207,367 
60/207,242 
60/207,452 
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United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
United States 
l\/lexico 
iVIexico 
Canada 
United States 
Europe 
United States 
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8/10/00 09/635,641 


Europe 


2750-1 005F(EP) 


00083.101 


8/11/00 


United States 


2750-1 005P 


00083.002 


8/11/00 09/637,563 


IVlexico 


2750-1 004F(MX) 


00082.102 


8/11/00 


Europe 


2750-1 004F(EP) 


00082.101 


8/11/00 


Canada 


2750-1 005F(CA) 


00083.100 


8/11/00 


Canada 


2750-1 004F(CA) 


00082.100 


8/11/00 


IVlexico 


2750-1 005F(MX) 


00083.102 


8/11/00 


United States 


2750-1 096P 


80142.004 


8/11/00 09/637,780 


IVlexico 


2750-1 003F(MX) 


00081.102 


8/11/00 


United States 


2750-11 01 P 


80141.005 


8/11/00 


United States 


2750-1 103P 


80141.007 


8/11/00 


United States 


2750-1 003P 


00081 .002 


8/11/00 09/637,837 


United States 


2750-1 004P 


00082.002 


8/11/00 09/636,555 


Europe 


2750-1 003F(EP) 


00081.101 


8/11/00 


United States 


2750-1 104P 


80141.008 


8/11/00 


United States 


2750-1 102P 


80141.006 


8/11/00 09/637,565 


Canada 


2750-1 003F(CA) 


00081.100 


8/11/00 


United States 


2750-1 082P 


80298.001 


8/14/00 


United States 


2750-1 083P 


91084.001 


8/1 4/00 


United States 


2750-1 085P 


91085.001 


8/1 5/00 


United States 


2750-1 084P 


80300.001 


8/1 5/00 


United States 


2750-1 095P 


80142.003 


8/1 6/00 09/643,672 


United States 


2750-1 1 1 0P 


80296.002 


8/1 6/00 60/225,850 


United States 


2750-1 108P 


80295.002 


8/1 6/00 


United States 


2750-1 11 2P 


80297.002 


8/1 6/00 


United States 


2750-1 106P 


80294.002 


8/16/00 


Europe 


2750-1 006F(EP) 


00084.101 


8/1 7/00 


United States 


2750-1 006P 


00084.002 


8/17/00 09/641,198 


Mexico 


2750-1 006F(MX) 


00084.102 


8/1 7/00 


Canada 


2750-1 006F(CA) 


00084.100 


8/1 7/00 


Canada 


2750-1 008F(CA) 


00086.100 


8/18/00 


Canada 


2750-1 007F(CA) 


00085.100 


8/18/00 


United States 


2750-1 007P 


00085.002 


8/18/00 


Mexico 


2750-1 007F(MX) 


00085.102 


8/1 8/00 


United States 


2750-1 1 1 1 P 


80297.001 


8/1 8/00 60/226,324 


Europe 


2750-1 008F(EP) 


00086.101 


8/1 8/00 


United States 


2750-1 008P 


00086.002 


8/18/00 


Europe 


2750-1 007F(EP) 


00085.101 


8/18/00 


United States 


2750-1 109P 


80296.001 


8/1 8/00 60/226,323 


United States 


2750-1 107P 


80295.001 


8/18/00 


mexico 


2750-1 009F{MX) 


00087.102 


8/18/00 


United States 


2750-1 098P 


80143.004 


8/1 8/00 09/643,671 


United States 


2750-1 009P 


00087.002 


8/1 8/00 09/640,695 


United States 


2750-1 105P 


80294.001 


8/1 8/00 60/226,325 
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Countrv 


Attomev No. 


Cifent No. 


Fiied AopliGStion No. 


Europe 


2750-1 009F(EP) 


00087.101 


8/1 8/00 


Mexico 


2750-1 008F{MX) 


00086.102 


8/1 8/00 


Canada 


2750-1 009F{CA) 


00087.100 


8/1 8/00 


United States 


2750-1 087P 


91086.001 


8/21/00 


United States 


2750-1 086P 


80301 .001 


8/21/00 


United States 


2750-1 1 24P 


80289.009 


8/23/00 


United States 


2750-1 1 1 7P 


80289.002 


8/23/00 


United States 


2750-1 1 1 BP 


80289.003 


8/23/00 


United States 


2750-1 122P 


80289.007 


8/23/00 


United States 


2750-1 1 31 P 


80289.01 6 


8/23/00 


United States 


2750-1 1 23P 


80289.008 


8/23/00 


United States 


2750-1 1 32P 


80289.01 7 


8/23/00 


United States 


2750-1 1 62P 


80307.001 


8/23/00 


Mexico 


2750-1 01 OF(MX) 


00088.102 


8/23/00 


Europe 


2750-1 01 OF(EP) 


00088.101 


8/23/00 


United States 


2750-1 01 OP 


00088.002 


8/23/00 09/643,854 


United States 


2750-1 116P 


80289.001 


8/23/00 


United States 


2750-1 097P 


80143.003 


8/23/00 09/649,866 


United States 


2750-1 163P 


91087.001 


8/23/00 


Canada 


2750-1 01 OF(CA) 


00088.100 


8/23/00 


United States 


2750-1 1 45P 


80289.030 


8/25/00 60/227,728 


United States 


2750-1 127P 


80289.012 


8/25/00 


United States 


2750-1 144P 


80289.029 


8/25/00 


United States 


2750-1 099P 


80144.003 


8/25/00 09/649,868 


United States 


2750-1 1 0OP 


80144.004 


8/25/00 09/649,867 


United States 


2750-1 1 40P 


80289.025 


8/25/00 


Mexico 


2750-1 01 3F(MX) 


00091.102 


8/25/00 


United States 


2750-1 1 26P 


80289.01 1 


8/25/00 


United States 


2750-11 51 P 


80289.036 


8/25/00 


United States 


2750-1 1 43P 


80289.028 


8/25/00 


United States 


2750-1 137P 


80289.022 


8/25/00 


United States 


2750-1 150P 


80289.035 


8/25/00 


United States 


2750-1 1 21 P 


80289.006 


8/25/00 


United States 


2750-1 136P 


80289.021 


8/25/00 


United States 


2750-1 1 35P 


80289.020 


8/25/00 


United States 


2750-1 1 34P 


80289.019 


8/25/00 


United States 


2750-1 1 20P 


80289.005 


8/25/00 


Mexico 


2750-1 01 2F(MX) 


00090.102 


8/25/00 


United States 


2750-1 1 33P 


80289.018 


8/25/00 


Mexico 


2750-1 01 1 F(MX) 


00089.102 


8/25/00 


Europe 


2750-1 01 2F(EP) 


00090.101 


8/25/00 


Canada 


2750-1 01 2F(CA) 


00090.100 


8/25/00 


United States 


2750-1 01 2P 


00090.002 


8/25/00 


United States 


2750-1 147P 


80289.032 


8/25/00 


United States 


2750-1 1 46P 


80289.031 


8/25/00 


United States 


2750-1 1 49P 


80289.034 


8/25/00 


Canada 


2750-1 01 3F(CA) 


00091.100 


8/25/00 


Europe 


2750-1 01 3F(EP) 


00091.101 


8/25/00 


United States 


2750-1 1 48P 


80289.033 


8/25/00 


Europe 


2750-1 01 1 F(EP) 


00089.101 


8/25/00 
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Country 


Attorney No. 


Client No. 


Fiied 


; AQDllc^km No* 



United States 2750 

United States 2750 

United States 2750 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

Canada 2750- 

United States 2750- 

Europe 2750- 

Mexlco 2750- 

United States 2750- 

United States 2750- 

Canada 2750- 

Canada 2750- 

Mexico 2750- 

United States 2750- 

Europe 2750- 

United States 2750- 

Unlted States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

Canada 2750- 

Mexico 2750- 

United States 2750- 

Europe 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

United States 2750- 

Europe 2750- 

Mexico 2750- 

Canada 2750- 

Mexico 2750- 

Unlted States 2750- 

Canada 2750- 

Europe 2750- 

Unlted States 2750- 

Mexico 2750- 

Europe 2750- 

Canada 2750- 

United States 2750- 

United States 2750- 



-1119P 
-1013P 
-1011P 
■1139P 
■1128P 
■1138P 
1142P 
1141P 
1152P 
1 01 1 F(CA) 
1 226P 
1014F{EP) 
1014F(I\/1X) 
1 225P 
1014P 
1 01 4F(CA) 
1015F(CA) 
1015F(I\/IX) 
1088P 
1015F(EP) 
1089P 
1090P 
1113P 
1091P 
1015P 
1 01 6F(CA) 
1 01 6F(IVIX) 
1016P 
1 01 6F(EP) 
1159P 
1 228P 
1160P 
1158P 
1157P 
1161P 
1227P 
1017P 
1017F(EP) 
1017F(MX) 
1 01 7F(CA) 
1018F(MX) 
1018P 
1 01 8F(CA) 
1018F(EP) 
1230P 
1 01 9F(MX) 
1019F(EP) 
1 01 9F(CA) 
1019P 
1229P 



80289.004 

00091 .002 

00089.002 

80289.024 

80289.01 3 

80289.023 

80289.027 

80289.026 

80289.037 

00089.100 

91088.001 

00092.101 

00092.102 

80308.001 

00092.002 

00092.100 

00093.100 

00093.102 

80302.001 

00093.101 

80302.002 

80303.001 

80304.001 

80303.002 

00093.002 

00094.100 

00094.102 

00094.002 

00094.101 

80305.002 

91089.001 

80306.001 

80305.001 

80304.002 

80306.002 

80309.001 

00095.002 

00095.101 

00095.102 

00095.100 

00096.102 

00096.002 

00096.100 

00096.101 

91090.001 

00098.102 

00098.101 

00098.100 

00098.002 

8031 0.001 



8/25/00 
8/25/00 
8/25/00 
8/25/00 
8/25/00 
8/25/00 
8/25/00 
8/25/00 
8/25/00 
8/25/00 
8/30/00 
8/30/00 
8/30/00 
8/30/00 
8/30/00 
8/30/00 
8/31/00 
8/31/00 
8/31/00 
8/31 /OO 
8/31 /OO 
8/31 /OO 
8/31 /OO 
8/31/00 
8/31/00 
9/1 /OO 
9/1/00 
9/1/00 
9/1/00 
9/6/00 
9/6/00 
9/6/00 
9/6/00 
9/6/00 
9/6/00 
9/6/00 
9/7/00 
9/7/00 
9/7/00 
9/7/00 
9/8/00 
9/8/00 
9/8/00 
9/8/00 
9/1 3/00 
9/1 3/00 
9/1 3/00 
9/1 3/00 
9/1 3/00 
9/13/00 



60/228,029 



60/228,167 
60/227,726 
60/227,729 
60/227,733 



09/651 ,370 



09/653,466 



09/654,547 



09/657,569 



09/660,883 
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Country 


Attomev No. 


Cllefit Ho. 


Filed Aoi^leatiOFi No. 


United States 


2750-1 021 P 


00101.002 


9/15/00 


Canada 


2750-1 021 F(CA) 


00101.100 


9/1 5/00 


Europe 


2750-1 021 F(EP) 


00101.101 


9/15/00 


United States 


2750-1 231 P 


8031 1 .001 


9/15/00 


Mexico 


2750-1 021 F(MX) 


00101.102 


9/15/00 


United States 


2750-1 020P 


00099.002 


9/15/00 


Canada 


2750-1 020F(CA) 


00099.100 


9/15/00 


Europe 


2750-1 020F(EP) 


00099.101 


9/1 5/00 


Mexico 


2750-1 020F(MX) 


00099.102 


9/1 5/00 


United States 


2750-1 232P 


91091.001 


9/1 5/00 


United States 


2750-1 233P 


80312.001 


9/1 8/00 


United States 


2750-1 234P 


91 092.001 


9/1 8/00 


United States 


2750-1 253P 


80313.001 


9/20/00 


United States 


2750-1 254P 


91 093.001 


9/20/00 


United States 


2750-1 022P 


00102.002 


9/20/00 


Canada 


2750-1 022F(CA) 


00102.100 


9/20/00 


Europe 


2750-1 022F(EP) 


00102.101 


9/20/00 


Mexico 


2750-1 022F(MX) 


00102.102 


9/20/00 


United States 


2750-1 261 P 


91 095.001 


9/21/00 


United States 


2750-1 260P 


91094.001 


9/21/00 


United States 


2750-1 259P 


80314.001 


9/21/00 


United States 


2750-1 023 P 


00103.002 


9/22/00 


Canada 


2750-1 025 F{CA) 


00105.100 


9/22/00 


Europe 


2750-1 023F(EP) 


00103.101 


9/22/00 


United States 


2750-1 025 P 


00105.002 


9/22/00 


Canada 


2750-1 023 F(CA) 


00103.100 


9/22/00 


Mexico 


2750-1 024F(MX) 


00104.102 


9/22/00 


Europe 


2750-1 024F(EP) 


00104.101 


9/22/00 


Canada 


2750-1 024F(CA) 


00104.100 


9/22/00 


United States 


2750-1 024P 


00104.002 


9/22/00 


Mexico 


2750-1 025F(MX) 


00105.102 


9/22/00 


Europe 


2750-1 025F(EP) 


00105.101 


9/22/00 


Mexico 


2750-1 023F(MX) 


00103.102 


9/22/00 


United States 


2750-1 264P 


80316.001 


9/25/00 


United States 


2750-1 265P 


91097.001 


9/25/00 


United States 


2750-1 263P 


91 096.001 


9/25/00 


United States 


2750-1 262 P 


80315.001 


9/25/00 


United States 


2750-1 266P 


80317.001 


9/26/00 


United States 


2750-1 267P 


91 098.001 


9/27/00 


United States 


2750-1 026P 


00106.002 


9/28/00 


United States 


2750-1 268P 


91 078.001 


9/28/00 


United States 


2750-1 269P 


91 099.001 


9/29/00 


United States 


2750-1 027P 


00107.002 


9/29/00 


United States 


2750-1 270P 


80318.001 


1 0/2/00 


United States 


2750-1 271 P 


91100.001 


1 0/2/00 


United States 


2750-1 028P 


00108.002 


1 0/4/00 


United States 


2750-1 272P 


8031 9.001 


1 0/4/00 


United States 


2750-1 273P 


91101.001 


1 0/4/00 


United States 


2750-1 274P 


80320.001 


1 0/5/00 


United States 


2750-1 029P 


00109.002 


1 0/5/00 
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Attomev No. 


Cii6iit No. 


Filed AoDHcatlorn No. 


United 


States 


2750-1 030P 


00110.002 


1 0/6/00 


United 


States 


2750-1 031 P 


00111.002 


1 0/6/00 


United 


States 


2750-1 032P 


00112.002 


1 0/6/00 


United 


States 


2750-1 277P 


80322.001 


10/10/00 


United 


States 


2750-1 276P 


91102.001 


10/10/00 


United 


States 


2750-1 275P 


80321 .001 


1 0/1 0/00 


United 


States 


2750-1 278P 


91103.001 


1 0/1 0/00 


United 


States 


2750-1 280P 


91104.001 


1 0/1 1 /oo 


United 


States 


2750-1 279P 


80323.001 


1 0/1 1 /oo 


United 


States 


2750-1 033P 


00113.002 


1 0/1 2/00 


United 


States 


2750-1 281 P 


80324.001 


1 0/1 7/00 


United 


States 


2750-1 282P 


91105.001 


1 0/1 7/00 


United 


States 


2750-1 034P 


00116.002 


1 0/1 8/00 



All applications listed in the table above are expressly incorporated herein by 
reference in their entirety and for all purposes. 

The SDFs of the invention can also be used as probes to search for genes that are 
related to the SDF within a species. Such related genes are typically considered to be 
members of a gene family. In such a case, the sequence similarity will often be concentrated 
into one or a few fragments of the sequence. The fragments of similar sequence that define 
the gene family typically encode a fragment of a protein or RNA that has an enzymatic or 
structural function. The percentage of identity in the amino acid sequence of the domain that 
defines the gene family is preferably at least 70%, more preferably 80 to 95%, most 
preferably 85 to 99%. To search for members of a gene family within a species, a low 
stringency hybridization is usually performed, but this will depend upon the size, distribution 
and degree of sequence divergence of domains that define the gene family. SDFs 
encompassing regulatory regions can be used to identify coordinately expressed genes by 
using the regulatory region sequence of the SDF as a probe. 

In the instances where the SDFs are identified as being expressed from genes that confer 
a particular phenotype, then the SDFs can also be used as probes to assay plants of different 
species for those phenotypes. 

I.e. Methods to Inhibit Gene Expression 
2 0 The nucleic acid molecules of the present invention can be used to inhibit gene 

transcription and/or translation. Example of such methods include, without limitation: 
Antisense Constructs; 
Ribozyme Constructs; 




1 

Attorney No. 2750-1300P 

49 

Chimeraplast Constructs; 
Co-Suppression; 
Transcriptional Silencing; and 
Other Methods of Gene Expression. 



C.l Antisense 

In some instances it is desirable to suppress expression of an endogenous or 
exogenous gene. A well-known instance is the FLAVOR-SAVOR^^ tomato, in which the 

1 0 gene encoding ACC synthase is inactivated by an antisense approach, thus delaying softening 
of the fruit after ripening. See for example, U.S. Patent No. 5,859,330; U.S. Patent No. 
5,723,766; Oeller, et al. Science, 254:437-439(1991); and Hamilton et Nature, 346:284- 
287 (1990), Also, timing of flowering can be controlled by suppression of the FLOWERING 
LOCUS C (FLC); high levels of this transcript are associated with late flowering, while 

15 absence of FLC is associated with early flowering (S.D, Michaels et al.. Plant Cell 11:949 
(1999). Also, the transition of apical meristem from production of leaves with associated 
shoots to flowering is regulated by TERMINAL FLOWERl, APETALAl andLEAFF. Thus, 
when it is desired to induce a transition from shoot production to flowering, it is desirable to 
suppress TFLl expression (S J. Liljegren, Plant Cell 11:1007 (1999)). As another instance, 

2 0 arrested ovule development and female sterility result from suppression of the ethylene 

forming enzyme but can be reversed by application of ethylene (D. De Martinis et al., Plant 
Cell 11:1061 (1999)). The ability to manipulate female fertility of plants is useful in 
increasing fruit production and creating hybrids. 

In the case of polynucleotides used to inhibit expression of an endogenous gene, the 

2 5 introduced sequence need not be perfectly identical to a sequence of the target endogenous gene. 

The introduced polynucleotide sequence will typically be at least substantially identical to the 
target endogenous sequence. 

Some polynucleotide SDFs in Table 1 represent sequences that are expressed in 
corn,wheat, rice, soybean Arabidopsis and/or other plants. Thus the invention includes using 

3 0 these sequences to generate antisense constructs to inhibit translation and/or degradation of 

transcripts of said SDFs, typically in a plant cell. 

To accomplish this, a polynucleotide segment from the desired gene that can hybridize to 
the mRNA expressed from the desired gene (the antisense segment") is operably linked to a 
promoter such that the antisense strand of RNA will be transcribed when the construct is present 
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in a host cell. A regulated promoter can be used in the construct to control transcription of the 
antisense segment so that transcription occurs only under desired circumstances. 

The antisense segment to be introduced generally will be substantially identical to at 
least a fragment of the endogenous gene or genes to be repressed. The sequence, however, need 
5 not be perfectly identical to inhibit expression. Further, the antisense product may hybridize to 
the untranslated region instead of or in addition to the coding sequence of the gene. The vectors 
of the present invention can be designed such that the inhibitory effect applies to other proteins 
within a family of genes exhibiting homology or substantial homology to the target gene. 

For antisense suppression, the introduced antisense segment sequence also need not 
10 be full length relative to either the primary transcription product or the fully processed 

mRNA. Generally, a higher percentage of sequence identity can be used to compensate for 
the use of a shorter sequence. Furthermore, the introduced sequence need not have the same 
r I intron or exon pattern, and homology of non-coding segments may be equally effective. 

J;: Normally, a sequence of between about 30 or 40 nucleotides and the full length of the 

^iiS 1 5 transcript canbe used, though a sequence of at least about 100 nucleotides is preferred, a 

sequence of at least about 200 nucleotides is more preferred, and a sequence of at least about 
f " 500 nucleotides is especially preferred, 

C^f C.2. Ribozymes 

O 20 It is also contemplated that gene constructs representing ribozymes and based on the 

^^'"^ SDFs in TABLE 1 are an object of the invention. Ribozymes can also be used to inhibit 

expression of genes by suppressing the translation of the mRNA into a polypeptide. It is 
possible to design ribozymes that specifically pair with virtually any target RNA and cleave the 
phosphodiester backbone at a specific location, thereby functionally inactivating the target RNA. 

2 5 In carrying out this cleavage, the ribozyme is not itself altered, and is thus capable of recycling 

and cleaving other molecules, making it a true enzyme. The inclusion of ribozyme sequences 
within antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the 
activity of the constructs. 

A number of classes of ribozymes have been identified. One class of ribozymes is 

3 0 derived from a number of small circular RNAs, which are capable of self-cleavage and 

replication in plants. The RNAs replicate either alone (viroid RNAs) or with a helper virus 
(satellite RNAs). Examples include RNAs from avocado sunblotch viroid and the satellite 
RNAs from tobacco ringspot virus, lucerne transient streak virus, velvet tobacco mottle virus, 



Attorney No. 2750-1300P 

51 

solanum nodiflorum mottle virus and subterranean clover mottle virus. The design and use of 
target RNA-specific ribozymes is described in Haseloff et al. Nature, 334:585 (1988). 

Like the antisense constructs above, the ribozyme sequence fragment necessary for 
pairing need not be identical to the target nucleotides to be cleaved, nor identical to the 
5 sequences in TABLE 1. Ribozymes may be constructed by combining the ribozyme 

sequence and some fragment of the target gene which v^ould allow recognition of the target 
gene mRNA by the resulting ribozyme molecule. Generally, the sequence in the ribozyme 
capable of binding to the target sequence exhibits a percentage of sequence identity with at 
least 80%, preferably with at least 85%, more preferably with at least 90% and most preferably 

1 0 with at least 95%, even more preferably, with at least 96%, 97%, 98% or 99% sequence identity 
to some fragment of a sequence in TABLE 1 or the complement thereof. The ribozyme can 
be equally effective in inhibiting mRNA translation by cleaving either in the untranslated or 
coding regions. Generally, a higher percentage of sequence identity can be used to 
compensate for the use of a shorter sequence. Furthermore, the introduced sequence need not 

1 5 have the same intron or exon pattern, and homology of non-coding segments may be equally 
effective. 

C.3. Chimeraplasts 

The SDFs of the invention, such as those described by Table 1, can also be used to 
construct chimeraplasts that can be introduced into a cell to produce at least one specific 
2 0 nucleotide change in a sequence corresponding to the SDF of the invention. A chimeraplast 
is an oligonucleotide comprising DNA and/or RNA that specifically hybridizes to a target 
region in a manner which creates a mismatched base-pair. This mismatched base-pair signals 
the celFs repair enzyme machinery which acts on the mismatched region resulting in the 
replacement, insertion or deletion of designated nucleotide(s). The altered sequence is then 

2 5 expressed by the cell's normal cellular mechanisms. Chimeraplasts can be designed to repair 

mutant genes, modify genes, introduce site-specific mutations, and/or act to interrupt or alter 
normal gene function (US Pat. Nos. 6,010,907 and 6,004,804; and PCT Pub. No. 
W099/58723 and WO99/07865). 

C.4. Sense Suppression 

The SDFs of Table 1 of the present invention are also useful to modulate gene 

3 0 expression by sense suppression. Sense suppression represents another method of gene 
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suppression by introducing at least one exogenous copy or fragment of the endogenous 

sequence to be suppressed. 

Introduction of expression cassettes in which a nucleic acid is configured in the sense 
orientation with respect to the promoter into the chromosome of a plant or by a self-replicating 
5 virus has been shown to be an effective means by which to induce degradation of mRNAs of 
target genes. For an example of the use of this method to modulate expression of endogenous 
genes see, Napoli et al., The Plant Cell 2:279 (1990), and U.S. Patents Nos. 5,034,323, 
5,231,020, and 5,283,184. Inhibition of expression may require some transcription of the 
introduced sequence. 

1 0 For sense suppression, the introduced sequence generally will be substantially identical 

to the endogenous sequence intended to be inactivated. The minimal percentage of sequence 
identity will typically be greater than about 65%, but a higher percentage of sequence identity 
might exert a more effective reduction in the level of normal gene products. Sequence identity 
of more than about 80% is preferred, though about 95% to absolute identity would be most 

1 5 preferred. As with antisense regulation, the effect would likely apply to any other proteins 

within a similar family of genes exhibiting homology or substantial homology to the suppressing 
sequence. 

C.5. Transcriptional Silencing 

2 0 The nucleic acid sequences of the invention, including the SDFs of Table 1, and 

fragments thereof, contain sequences that can be inserted into the genome of an organism 
resulting in transcriptional silencing. Such regulatory sequences need not be operatively linked 
to coding sequences to modulate transcription of a gene. Specifically, a promoter sequence 
without any other element of a gene can be introduced into a genome to transcriptionally silence 

25 an endogenous gene (see, for example, Vaucheret, H et al. (1998) The Plant Joumal 16: 651- 
659). As another example, triple helices can be formed using oligonucleotides based on 
sequences from TABLE 1, fragments thereof, and substantially similar sequence thereto. The 
oligonucleotide can be delivered to the host cell and can bind to the promoter in the genome to 
form a triple helix and prevent transcription. An oligonucleotide of interest is one that can bind 

30 to the promoter and block binding of a transcription factor to the promoter. In such a case, the 
oligonucleotide can be complementary to the sequences of the promoter that interact with 
transcription binding factors. 
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C.6. Other Methods to Inhibit Gene Expression 

Yet another means of suppressing gene expression is to insert a polynucleotide into the 
gene of interest to disrupt transcription or translation of the gene. 

Low frequency homologous recombination can be used to target a polynucleotide insert 
5 to a gene by flanking the polynucleotide insert with sequences that are substantially similar to 
the gene to be disrupted. Sequences from TABLE 1, fragments thereof, and substantially similar 
sequence thereto can be used for homologous recombination. 

In addition, random insertion of polynucleotides into a host cell genome can also be used 
to disrupt the gene of interest. Azpiroz-Leehan et al.. Trends in Genetics 13:152 (1997). In this 
1 0 method, screening for clones from a library containing random insertions is preferred to 

identifying those that have polynucleotides inserted into the gene of interest. Such screening can 
r;j be performed using probes and/or primers described above based on sequences from TABLE 1, 

|f fragments thereof, and substantially similar sequence thereto. The screening can also be 

w;5 performed by selecting clones or Ri plants having a desired phenotype. 

1 5 LD. Methods of Functional Analysis 

. The constructs described in the methods under I.C. above can be used to determine 

% the function of the polypeptide encoded by the gene that is targeted by the constructs, 

nj Down-regulating the transcription and translation of the targeted gene in the host cell 

Q or organisms, such as a plant, may produce phenotypic changes as compared to a wild-type 

2 0 cell or organism. In addition, in vitro assays can be used to determine if any biological 
activity, such as calcium flux, DNA transcription, nucleotide incorporation, etc., are being 
modulated by the down-regulation of the targeted gene. 

Coordinated regulation of sets of genes, e.g., those contributing to a desired polygenic 
trait, is sometimes necessary to obtain a desired phenotype. SDFs of the invention 

2 5 representing transcription activation and DNA binding domains can be assembled into hybrid 

transcriptional activators. These hybrid transcriptional activators can be used with their 
corresponding DNA elements (i.e., those bound by the DNA-binding SDFs) to effect 
coordinated expression of desired genes (J.J. Schwarz et aL, Mol Cell Biol 12:266 (1992), 
A. Martinez et aL, Mol Gen. Genet 261:546 (1999)). 

3 0 The SDFs of the invention can also be used in the two-hybrid genetic systems to 

identify networks of protein-protein interactions (L. McAlister-Henn et aL, Methods 19:330 
(1999), J.C, Hu et aL, Methods 20:80 (2000), M. Golovkin et aL, /. Biol Chem. 274:36428 
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(1999), K. Ichimura et aL, Biochem, Biophys. Res, Comm. 253:532 (1998)), The SDFs of the 
invention can also be used in various expression display methods to identify important 
protein-DNA interactions (e.g. B. Luo et al., J. MoL Biol 266:479 (1997)). 

I.E. Promoters 

5 The SDFs of the invention are also useful as structural or regulatory sequences in a 

construct for modulating the expression of the corresponding gene in a plant or other organism, 
e.g. a symbiotic bacterium. For example, promoter sequences associated to SDFs of Table 1 of 
the present invention can be useful in directing expression of coding sequences either as 
constitutive promoters or to direct expression in particular cell types, tissues, or organs or in 
1 0 response to environmental stimuli. 

With respect to the SDFs of the present invention a promoter is likely to be a relatively 
small portion of a genomic DNA (gDNA) sequence located in the first 2000 nucleotides 
upstream from an initial exon identified in a gDNA sequence or initial ATG" or methionine 

1 5 codon or translational start site in a corresponding cDNA sequence. Such promoters are more 
likely to be found in the first 1000 nucleotides upstream of an initial ATG or methionine codon 
or translational start site of a cDNA sequence corresponding to a gDNA sequence. In particular, 
the promoter is usually located upstream of the transcription start site. The fragments of a 
particular gDNA sequence that function as elements of a promoter in a plant cell will preferably 

20 be found to hybridize to gDNA sequences presented and described in Table 1 at medium or high 
stringency, relevant to the length of the probe and its base composition. 

Promoters are generally modular in nature. Promoters can consist of a basal promoter 
that functions as a site for assembly of a transcription complex comprising an RNA polymerase, 
for example RNA polymerase 11. A typical transcription complex will include additional factors 

2 5 such as TFnB, TFuD, and TFnE. Of these, TFuD appears to be the only one to bind DNA 

directly. The promoter might also contain one or more enhancers and/or suppressors that 
function as binding sites for additional transcription factors that have the function of modulating 
the level of transcription with respect to tissue specificity and of transcriptional responses to 
particular environmental or nutritional factors, and the like. 

3 0 Short DNA sequences representing binding sites for proteins can be separated from each 

other by intervening sequences of varying length. For example, within a particular functional 
module, protein binding sites may be constituted by regions of 5 to 60, preferably 10 to 30, more 
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preferably 10 to 20 nucleotides. Within such binding sites, there are typically 2 to 6 nucleotides 
that specifically contact amino acids of the nucleic acid binding protein. The protein binding 
sites are usually separated from each other by 10 to several hundred nucleotides, typically by 15 
to 150 nucleotides, often by 20 to 50 nucleotides. DNA binding sites in promoter elements often 
display dyad symmetry in their sequence. Often elements binding several different proteins, 
and/or a plurality of sites that bind the same protein, will be combined in a region of 50 to 1,000 
basepairs. 

Elements that have transcription regulatory function can be isolated from their 
corresponding endogenous gene, or the desired sequence can be synthesized, and recombined in 
constructs to direct expression of a coding region of a gene in a desired tissue-specific, temporal- 
specific or other desired manner of inducibility or suppression. When hybridizations are 
performed to identify or isolate elements of a promoter by hybridization to the long sequences 
presented in TABLE 1, conditions are adjusted to account for the above-described nature of 
promoters. For example short probes, constituting the element sought, are preferably used under 
low temperature and/or high salt conditions. When long probes, which might include several 
promoter elements are used, low to medium stringency conditions are preferred when 
hybridizing to promoters across species. 

If a nucleotide sequence of an SDF, or part of the SDF, functions as a promoter or 
fragment of a promoter, then nucleotide substitutions, insertions or deletions that do not 
substantially affect the binding of relevant DNA binding proteins would be considered 
equivalent to the exemplified nucleotide sequence. It is envisioned that there are instances 
where it is desirable to decrease the binding of relevant DNA binding proteins to silence or 
down-regulate a promoter, or conversely to increase the binding of relevant DNA binding 
proteins to enhance or up-regulate a promoter and vice versa. In such instances, 
polynucleotides representing changes to the nucleotide sequence of the DNA-protein contact 
region by insertion of additional nucleotides, changes to identity of relevant nucleotides, 
including use of chemically-modified bases, or deletion of one or more nucleotides are 
considered encompassed by the present invention. In addition, fragments of the promoter 
sequences described by Table 1 and variants thereof can be fused with other promoters or 
fragments to facilitate transcription and/or transcription in specific type of cells or under 
specific conditions. 

Promoter function can be assayed by methods known in the art, preferably by 
measuring activity of a reporter gene operatively linked to the sequence being tested for 
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promoter function. Examples of reporter genes include those encoding luciferase, green 
fluorescent protein, GUS, neo, cat and bar. 

I.F. UTRs and Junctions 

Polynucleotides comprising untranslated (UTR) sequences and intron/exon junctions are 
also within the scope of the invention. UTR sequences include introns and 5' or 3' untranslated 
regions (5' UTRs or 3' UTRs). Fragments of the sequences shown in TABLE 1 can comprise 
UTRs and intron/exon junctions. 

These fragments of SDFs, especially UTRs, can have regulatory functions related to, for 
example, translation rate and mRNA stability. Thus, these fragments of SDFs can be isolated 
for use as elements of gene constructs for regulated production of polynucleotides encoding 
desired polypeptides. 

Introns of genomic DNA segments might also have regulatory functions. Sometimes 
regulatory elements, especially transcription enhancer or suppressor elements, are found 
within introns. Also, elements related to stability of heteronuclear RNA and efficiency of 
splicing and of transport to the cytoplasm for translation can be found in intron elements. 
Thus, these segments can also find use as elements of expression vectors intended for use to 
transform plants. 

Just as with promoters UTR sequences and intron/exon junctions can vary from those 
shown in TABLE 1. Such changes from those sequences preferably will not affect the 
regulatory activity of the UTRs or intron/exon junction sequences on expression, 
transcription, or translation unless selected to do so. However, in some instances, down- or 
up-regulation of such activity may be desired to modulate traits or phenotypic or in vitro 
activity. 

LG. Coding Sequences 

Isolated polynucleotides of the invention can include coding sequences that encode 
polypeptides comprising an amino acid sequence encoded by sequences in TABLE 1 or an 
amino acid sequence presented in TABLE 1. 

A nucleotide sequence encodes a polypeptide if a cell (or a cell free in vitro system) 
expressing that nucleotide sequence produces a polypeptide having the recited amino acid 
sequence when the nucleotide sequence is transcribed and the primary transcript is 
subsequently processed and translated by a host cell (or a cell free in vitro system) harboring 
the nucleic acid. Thus, an isolated nucleic acid that encodes a particular amino acid sequence 
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can be a genomic sequence comprising exons and introns or a cDNA sequence that represents 
the product of splicing thereof. An isolated nucleic acid encoding an amino acid sequence 
also encompasses heteronuclear RNA, which contains sequences that are spliced out during 
expression, and mRNA, which lacks those sequences. 
5 Coding sequences can be constructed using chemical synthesis techniques or by 

isolating coding sequences or by modifying such synthesized or isolated coding sequences as 
described above. 

In addition to coding sequences encoding the polypeptide sequences of TABLE 1, 
which are native to corn, Arabidopsis, soybean, rice, wheat, and other plants the isolated 

1 0 polynucleotides can be polynucleotides that encode variants, fragments, and fusions of those 
native proteins. Such polypeptides are described below in part IL 

In variant polynucleotides generally, the number of substitutions, deletions or insertions 
is preferably less than 20%, more preferably less than 15%; even more preferably less than 10%, 
5%, 3% or 1% of the number of nucleotides comprising a particularly exemplified sequence. It 

15 is generally expected that non-degenerate nucleotide sequence changes that result in 1 to 10, 
more preferably 1 to 5 and most preferably 1 to 3 amino acid insertions, deletions or 
substitutions will not greatly affect the function of an encoded polypeptide. The most preferred 
embodiments are those wherein 1 to 20, preferably 1 to 10, most preferably 1 to 5 nucleotides 
are added to, deleted from and/or substituted in the sequences specifically disclosed in TABLE 

20 1. 

Insertions or deletions in polynucleotides intended to be used for encoding a polypeptide 
preferably preserve the reading frame. This consideration is not so important in instances when 
the polynucleotide is intended to be used as a hybridization probe. 

IL Polypeptides and Proteins 
2 5 IIA. Native polypeptides and proteins 

Polypeptides within the scope of the invention include both native proteins as well as 
variants, fragments, and fusions thereof. Polypeptides of the invention are those encoded by 
any of the six reading frames of sequences shown in TABLE 1, preferably encoded by the 
30 three frames reading in the 5' to 3' direction of the sequences as shown. 

Native polypeptides include the proteins encoded by the sequences shown in TABLE 
1. Such native polypeptides include those encoded by allelic variants. 
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Polypeptide and protein variants will exhibit at least 75% sequence identity to those 
native polypeptides of TABLE 1. More preferably, the polypeptide variants will exhibit at least 
85% sequence identity; even more preferably, at least 90% sequence identity; more preferably at 
least 95%, 96%, 97%, 98%, or 99% sequence identity. Fragments of polypeptide or fragments 
of polypeptides will exhibit similar percentages of sequence identity to the relevant fragments 
of the native polypeptide. Fusions will exhibit a similar percentage of sequence identity in that 
fragment of the fusion represented by the variant of the native peptide. 

Furthermore, polypeptide variants will exhibit at least one of the functional properties of 
the native protein. Such properties include, without limitation, protein interaction, DNA 
interaction, biological activity, immunological activity, receptor binding, signal transduction, 
transcription activity, growth factor activity, secondary structure, three-dimensional structure, 
etc. As to properties related to in vitro or in vivo activities, the variants preferably exhibit at least 
60% of the activity of the native protein; more preferably at least 70%, even more preferably at 
least 80%, 85%, 90% or 95% of at least one activity of the native protein. 

One type of variant of native polypeptides comprises amino acid substitutions, deletions 
and/or insertions. Conservative substitutions are preferred to maintain the function or activity of 
the polypeptide. 

Within the scope of percentage of sequence identity described above, a polypeptide of 
the invention may have additional individual amino acids or amino acid sequences inserted into 
the polypeptide in the middle thereof and/or at the N-terminal and/or C-terminal ends thereof. 
Likewise, some of the amino acids or amino acid sequences may be deleted from the 
polypeptide. 

A.l Antibodies 

Isolated polypeptides can be utilized to produce antibodies. Polypeptides of the invention 
can generally be used, for example, as antigens for raising antibodies by known techniques. The 
resulting antibodies are useful as reagents for determining the distribution of the antigen protein 
within the tissues of a plant or within a cell of a plant. The antibodies are also useful for 
examining the production level of proteins in various tissues, for example in a wild-type plant or 
following genetic manipulation of a plant, by methods such as Westem blotting. 

Antibodies of the present invention, both polyclonal and monoclonal, may be prepared 
by conventional methods. In general, the polypeptides of the invention are first used to 
immunize a suitable animal, such as a mouse, rat, rabbit, or goat. Rabbits and goats are 
preferred for the preparation of polyclonal sera due to the volume of serum obtainable, and the 
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availability of labeled anti-rabbit and anti-goat antibodies as detection reagents. Immunization is 
generally performed by mixing or emulsifying the protein in saline, preferably in an adjuvant 
such as Freund's complete adjuvant, and injecting the mixture or emulsion parenterally 
(generally subcutaneously or intramuscularly). A dose of 50-200 |LLg/injection is typically 
sufficient. Immunization is generally boosted 2-6 weeks later with one or more injections of the 
protein in saline, preferably using Freund's incomplete adjuvant. One may altematively 
generate antibodies by in vitro immunization using methods known in the art, which for the 
purposes of this invention is considered equivalent to in vivo immunization. 

Polyclonal antisera is obtained by bleeding the immunized animal into a glass or plastic 
container, incubating the blood at 25^C for one hour, followed by incubating the blood at 4°C 
for 2-18 hours. The serum is recovered by centrifugation (e.g., l,000xg for 10 minutes). About 
20-50 ml per bleed may be obtained from rabbits. 

Monoclonal antibodies are prepared using the method of Kohler and Milstein, Nature 
256: 495 (1975), or modification thereof. Typically, a mouse or rat is immunized as described 
above. However, rather than bleeding the animal to extract serum, the spleen (and optionally 
several large lymph nodes) is removed and dissociated into single cells. If desired, the spleen 
cells can be screened (after removal of nonspecifically adherent cells) by applying a cell 
suspension to a plate, or well, coated with the protein antigen. B-cells producing membrane- 
bound immunoglobulin specific for the antigen bind to the plate, and are not rinsed away with 
the rest of the suspension. Resulting B-cells, or all dissociated spleen cells, are then induced to 
fuse with myeloma cells to form hybridomas, and are cultured in a selective medium (e.g., 
hypoxanthine, aminopterin, thymidine medium, HAT"). The resulting hybridomas are plated 
by limiting dilution, and are assayed for the production of antibodies which bind specifically to 
the immunizing antigen (and which do not bind to unrelated antigens). The selected Mab- 
secreting hybridomas are then cultured either in vitro {e.g., in tissue culture bottles or hollow 
fiber reactors), or in vivo (as ascites in mice). 

Other methods for sustaining antibody-producing B-cell clones, such as by EBV 
transformation, are known. 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive atoms 
(particularly ^^P and ^^I), electron-dense reagents, enzymes, and ligands having specific binding 
partners. Enzymes are typically detected by their activity. For example, horseradish peroxidase 
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is usually detected by its ability to convert 3,3',5,5'-tetramethylbenzidine (TNB) to a blue 
pigment, quantifiable with a spectrophotometer. 

A.2 In Vitro Applications of Polypeptides 

Some polypeptides of the invention will have enzymatic activities that are useful in vitro. 
For example, the soybean trypsin inhibitor (Kunitz) family is one of the numerous families of 
proteinase inhibitors. It comprises plant proteins which have inhibitory activity against serine 
proteinases from the trypsin and subtilisin families, thiol proteinases and aspartic proteinases. 
Thus, these peptides find in vitro use in protein purification protocols and perhaps in 
therapeutic settings requiring topical application of protease inhibitors. 

Delta-aminolevulinic acid dehydratase (EC 4.2. L24 ) (ALAD) catalyzes the second 
step in the biosynthesis of heme, the condensation of two molecules of 5-aminolevulinate to 
form porphobilinogen and is also involved in chlorophyll biosynthesis(Kaczor et al. (1994) 
Plant Physiol. 1-4: 1411-7; Smith (1988) Biochem. J. 249: 423-8; Schneider (1976) Z. 
naturforsch. [C] 31: 55-63). Thus, ALAD proteins can be used as catalysts in synthesis of 
heme derivatives. Enzymes of biosynthetic pathways generally can be used as catalysts for in 
vitro synthesis of the compounds representing products of the pathway. 

Polypeptides encoded by SDFs of the invention can be engineered to provide 
purification reagents to identify and purify additional polypeptides that bind to them. This 
allows one to identify proteins that function as multimers or elucidate signal transduction or 
metabolic pathways. In the case of DNA binding proteins, the polypeptide can be used in a 
similar manner to identify the DNA determinants of specific binding (S. Pierrou et aL, A«a/. 
Biochem. 229:99 (1995), S. Chusacultanachai et al., J. Biol Chem, 274:23591 (1999), Q, Lin 
et al.,/, Biol Chem, 272:27274 (1997)). 

ILB . POLYPEPTIDE VARIANTS , FRAGMENTS, AND FUSIONS 
Generally, variants , fragments, or fusions of the polypeptides encoded by the SDFs of 

the invention can exhibit at least one of the activities of the identified domains and/or related 

polypeptides described in Table 1 corresponding to the SDF of interest. 

ILB .(1) Variants 

A type of variant of the native polypeptides comprises amino acid substitutions. 
Conservative substitutions, described above (see IL), are preferred to maintain the function or 
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activity of the polypeptide. Such substitutions include conservation of charge, polarity, 
hydrophobicity, size, etc. For example, one or more amino acid residues within the sequence 
can be substituted with another amino acid of similar polarity that acts as a functional 
equivalent, for example providing a hydrogen bond in an enzymatic catalysis. Substitutes for an 
amino acid within an exemplified sequence are preferably made among the members of the class 
to which the amino acid belongs. For example, the nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. The 
polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids include arginine, lysine and histidine. 
The negatively charged (acidic) amino acids include aspartic acid and glutamic acid. 

Within the scope of percentage of sequence identity described above, a polypeptide of 
the invention may have additional individual amino acids or amino acid sequences inserted into 
the polypeptide in the middle thereof and/or at the N-terminal and/or C-terminal ends thereof. 
Likewise, some of the amino acids or amino acid sequences may be deleted from the 
polypeptide. Amino acid substitutions may also be made in the sequences; conservative 
substitutions being preferred. 

One preferred class of variants are those that comprise (1) the domain of an 
encoded polypeptide and/or (2) residues conserved between the encoded polypeptide and 
related polypeptides. For this class of variants, the encoded polypeptide sequence is changed 
by insertion, deletion, or substitution at positions flanking the domain and/or conserved 
residues. 

Another class of variants includes those that comprise an encoded polypeptide 
sequence that is changed in the domain or conserved residues by a conservative substitution. 

Yet another class of variants includes those that lack one of the in vitro 
activities, or structural features of the encoded polypeptides. One example is polypeptides or 
proteins produced from genes comprising dominant negative mutations. Such a variant may 
comprise an encoded polypeptide sequence with non-conservative changes in a particular 
domain or group of conserved residues. 

ILA.(2) FIIAGMENTS 

Fragments of particular interest are those that comprise a domain identified for a 
polypeptide encoded by an SDF of the instant invention and variants thereof. Also, fragments 
that comprise at least one region of residues conserved between an SDF encoded polypeptide 
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and its related polypeptides are of great interest. Fragments are sometimes useful as 
polypeptides corresponding to genes comprising dominant negative mutations are. 

II.A,(3)FUSI0NS 

Of interest are chimeras comprising (1) a fragment of the SDF encoded 
polypeptide or variants thereof of interest and (2) a fragment of a polypeptide comprising the 
same domain. For example, an AP2 helix encoded by a SDF of the invention fused to second 
AP2 helix from ANT protein, which comprises two AP2 helices. The present invention also 
encompasses fusions of SDF encoded polypeptides, variants, or fragments thereof fused with 
related proteins or fragments thereof. 
DEFINITION OF DOMAINS 

The polypeptides of the invention may possess identifying domains. In addition, the 
domains within the SDF encoded polypeptide can be defined by the region that exhibits at 
least 70% sequence identity with the consensus sequences listed in the detailed description 
below of each of the domains. 

The majority of the protein domain descriptions given in the protein domain table are 
obtained from Prosite, (http//www,expasy.ch/prosite/), and Pfam, 
(http//pfam.wusd.edu/browse.shtml). 

A. Activities of Polypeptides Comprising Signal Peptides 

Polypeptides comprising signal peptides are a family of proteins that are typically 
targeted to (1) a particular organelle or intracellular compartment, (2) interact with a 
particular molecule or (3) for secretion outside of a host cell. Example of polypeptides 
comprising signal peptides include, without limitation, secreted proteins, soluble proteins, 
receptors, proteins retained in the ER, etc. 

These proteins comprising signal peptides are useful to modulate ligand-receptor 
interactions, cell-to-cell communication, signal transduction, intracellular communication, 
and activities and/or chemical cascades that take part in an organism outside or within of any 
particular cell. 

One class of such proteins are soluble proteins which are transported out of the cell. 
These proteins can act as ligands that bind to receptor to trigger signal transduction or to 
permit communication between cells. 
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Another class is receptor proteins which also comprise a retention domain that lodges 
the receptor protein in the membrane when the cell transports the receptor to the surface of 
the cell. Like the soluble ligands, receptors can also modulate signal transduction and 
communication between cells. 

In addition the signal peptide itself can serve as a ligand for some receptors. An 
example is the interaction of the ER targeting signal peptide with the signal recognition 
particle (SRP). Here, the SRP binds to the signal peptide, halting translation, and the 
resulting SRP complex then binds to docking proteins located on the surface of the ER, 
prompting transfer of the protein into the ER. 



A description of signal peptide residue composition is described below in Subsection 

IV.C.l. 
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III. Methods of Modulating Polypeptide Production 

It is contemplated that polynucleotides of the invention can be incorporated into a 
host cell or in-vitro system to modulate polypeptide production. For instance, the SDFs 
prepared as described herein can be used to prepare expression cassettes useful in a number of 
techniques for suppressing or enhancing expression. 
5 An example are polynucleotides comprising sequences to be transcribed, such as 

coding sequences, of the present invention can be inserted into nucleic acid constructs to 
modulate polypeptide production. Typically, such sequences to be transcribed are 
heterologous to at least one element of the nucleic acid construct to generate a chimeric gene 
or construct. 

1 0 Another example of useful polynucleotides are nucleic acid molecules comprising 

regulatory sequences of the present invention. Chimeric genes or constructs can be generated 
fj when the regulatory sequences of the invention linked to heterologous sequences in a vector 

J;f construct. Within the scope of invention are such chimeric gene and/or constructs. 

y;]i Also within the scope of the invention are nucleic acid molecules, whereof at least a part 

y\ 15 or fragment of these DNA molecules are presented in TABLE 1 of the present application, and 
; wherein the coding sequence is under the control of its own promoter and/or its own regulatory 
. elements. Such molecules are useful for transforming the genome of a host cell or an organism 

regenerated from said host cell for modulating polypeptide production, 
[y Additionally, a vector capable of producing the oligonucleotide can be inserted into the 

ri 2 0 host cell to deliver the oligonucleotide. 

More detailed description of components to be included in vector constructs are 

described both above and below. 

Whether the chimeric vectors or native nucleic acids are utilized, such 

polynucleotides can be incorporated into a host cell to modulate polypeptide production. 

2 5 Native genes and/or nucleic acid molecules can be effective when exogenous to the host cell. 

Methods of modulating polypeptide expression includes, without limitation: 
Suppression methods, such as 

Antisense 

Ribozymes 

3 0 Co-suppression 

Insertion of Sequences into the Gene to be Modulated 
Regulatory Sequence Modulation. 
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as well as Methods for Enhancing Production, such as 
Insertion of Exogenous Sequences; and 
Regulatory Sequence Modulation. 

III.A. Suppression 

Expression cassettes of the invention can be used to suppress expression of 
endogenous genes which comprise the SDF sequence. Inhibiting expression can be useful, 
for instance, to tailor the ripening characteristics of a fruit (Oeller et aL, Science 254:437 
(1991)) or to influence seed size_(WO98/07842) or to provoke cell ablation (Mariani et al.. 
Nature 357: 384-387 (1992). 

As described above, a number of methods can be used to inhibit gene expression in 
plants, such as antisense, ribozyme, introduction of exogenous genes into a host cell, 
insertion of a polynucleotide sequence into the coding sequence and/or the promoter of the 
endogenous gene of interest, and the like. 

Ill .A. 1 . Antisense 

An expression cassette as described above can be transformed into host cell or 
plant to produce an antisense strand of RNA. For plant cells, antisense RNA inhibits gene 
expression by preventing the accumulation of mRNA which encodes the enzyme of interest, see, 
e.g., Sheehy et al., Proa Nat Acad. Set USA, 85:8805 (1988), and Hiatt et al., U.S. Patent No. 
4,801,340. 

in.A,2. Ribozymes 

Similarly, ribozyme constructs can be transformed into a plant to cleave mRNA 
and down-regulate translation. 

III.A.3. Co-Suppression 

Another method of suppression is by introducing an exogenous copy of the gene 
to be suppressed. Introduction of expression cassettes in which a nucleic acid is configured in 
the sense orientation with respect to the promoter has been shown to prevent the accumulation of 
mRNA. A detailed description of this method is described above. 



III. .A.. 4. 



Insertion of Sequences into the Gene to be Modulated 
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Yet another means of suppressing gene expression is to insert a polynucleotide 
into the gene of interest to disrupt transcription or translation of the gene. 

Homologous recombination could be used to target a polynucleotide insert to a 
gene using the Cre-Lox system (A.C, Vergunst et al., Nucleic Acids Res. 26:2729 (1998), A.C. 
5 Vergunst et al., Plant MoL Biol 38:393 (1998), H. Albert et aL, Plant J, 7:649 (1995)). 

In addition, random insertion of polynucleotides into a host cell genome can also 
be used to disrupt the gene of interest. Azpiroz-Leehan et aL, Trends in Genetics 13:152 (1997). 
In this method, screening for clones from a library containing random insertions is preferred for 
identifying those that have polynucleotides inserted into the gene of interest. Such screening can 
10 be performed using probes and/or primers described above based on sequences from TABLE 1, 
fragments thereof, and substantially similar sequence thereto. The screening can also be 
performed by selecting clones or any transgenic plants having a desired phenotype. 

IILA.5. Regulatory SequenceModulation 

The SDFs described in Table 1, and fragments thereof are examples of 
1 5 nucleotides of the invention that contain regulatory sequences that can be used to suppress or 
inactivate transcription and/or translation from a gene of interest as discussed in LC.5. 



III.A.6. Genes Comprising Dominant-Negative Mutations 

When suppression of production of the endogenous, native protein is desired it 
2 0 is often helpful to express a gene comprising a dominant negative mutation. Production of 
protein variants produced from genes comprising dominant negative mutations is a useful 
tool for research Genes comprising dominant negative mutations can produce a variant 
polypeptide which is capable of competing with the native polypeptide, but which does not 
produce the native result. Consequently, over expression of genes comprising these mutations 
2 5 can titrate out an undesired activity of the native protein. For example. The product from a 
gene comprising a dominant negative mutation of a receptor can be used to constitutively 
activate or suppress a signal transduction cascade, allowing examination of the phenotype 
and thus the trait (s) controlled by that receptor and pathway. Alternatively, the protein arising 
from the gene comprising a dominant-negative mutation can be an inactive enzyme still capable 
30 of binding to the same substrate as the native protein and therefore competes with such native 
protein. 
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Products from genes comprising dominant-negative mutations can also act upon 
the native protein itself to prevent activity. For example, the native protein may be active only 
as a homo-multimer or as one subunit of a hetero-multimer. Incorporation of an inactive subunit 
into the multimer with native subunit(s) can inhibit activity. 
5 Thus, gene function can be modulated in host cells of interest by insertion into 

these cells vector constructs comprising a gene comprising a dominant-negative mutation. 

IILB, Enhanced Expression 

Enhanced expression of a gene of interest in a host cell can be accomplished by either 
(1) insertion of an exogenous gene; or (2) promoter modulation. 

10 III.B.l. Insertion of an Exogenous Gene 

Insertion of an expression construct encoding an exogenous gene can boost the 
number of gene copies expressed in a host cell. 

Such expression constructs can comprise genes that either encode the native 
protein that is of interest or that encode a variant that exhibits enhanced activity as compared to 
1 5 the native protein. Such genes encoding proteins of interest can be constructed from the 
sequences from TABLE 1, fragments thereof, and substantially similar sequence thereto. 

Such an exogenous gene can include either a constitutive promoter permitting 
expression in any cell in a host organism or a promoter that directs transcription only in 
particular cells or times during a host cell life cycle or in response to environmental stimuli. 

2 0 III.B.2. Regulatory Sequence Modulation 

The SDFs of Table 1, and fragments thereof, contain regulatory sequences that 
can be used to enhance expression of a gene of interest. For example, some of these sequences 
contain useful enhancer elements. In some cases, duplication of enhancer elements or insertion 
of exogenous enhancer elements will increase expression of a desired gene from a particular 

2 5 promoter. As other examples, all 11 promoters require binding of a regulatory protein to be 

activated, while some promoters may need a protein that signals a promoter binding protein to 
expose a polymerase binding site. In either case, over-production of such proteins can be used 
to enhance expression of a gene of interest by increasing the activation time of the promoter. 

Such regulatory proteins are encoded by some of the sequences in TABLE 1, 

3 0 fragments thereof, and substantially similar sequences thereto. 

Coding sequences for these proteins can be constructed as described above. 
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IV. Gene Constructs and Vector Construction 

To use isolated SDFs of the present invention or a combination of them or parts and/or 
mutants and/or fusions of said SDFs in the above techniques, recombinant DNA vectors which 
5 comprise said SDFs and are suitable for transformation of cells, such as plant cells, are usually 
prepared. The SDF construct can be made using standard recombinant DNA techniques 
(Sambrook et al. 1989) and can be introduced to the species of interest hy Agrobacterium- 
mediated transformation or by other means of transformation {e.g., particle gun 
bombardment) as referenced below. 
1 0 The vector backbone can be any of those typical in the art such as plasmids, viruses, 

artificial chromosomes, BACs, YACs and PACs and vectors of the sort described by 

(a) BAC: Shizuya et al., Proc. NatL Acad. Sci. USA 89: 8794-8797 (1992); 
Hamilton et al., Proc. Natl. Acad. Sci. USA 93: 9975-9979 (1996); 

(b) YAC: Burke et al., Science 236:806-812 (1987);. 

1 5 (c) PAC: Sternberg N. et al,, Proc Natl Acad Sci USA. Jan;87(l): 103-7 (1990); 

(d) Bacteria- Yeast Shuttle Vectors: Bradshaw et al., Nucl Acids Res 23: 4850- 
4856 (1995); 

(e) Lambda Phage Vectors: Replacement Vector, e.g., 
Frischauf et al., J. Mol Biol 170: 827-842 (1983); or Insertion vector, e.g., 

2 0 Huynh et al., In: Glover NM (ed) DNA Cloning: A practical Approach, Vol.1 Oxford: IRL 
Press (1985); 

(f) T-DNA gene fusion vectors :Walden et al., Mol Cell Biol 1: 175-194 (1990); 
and 

(g) Plasmid vectors: Sambrook et al., infra. 

2 5 Typically, a vector will comprise the exogenous gene, which in its tum comprises an 

SDF of the present invention to be introduced into the genome of a host cell, and which gene 
may be an antisense construct, a ribozyme constmct chimeraplast, or a coding sequence with 
any desired transcriptional and/or translational regulatory sequences, such as promoters, UTRs, 
and y end termination sequences. Vectors of the invention can also include origins of 

3 0 replication, scaffold attachment regions (SARs), markers, homologous sequences, introns, etc. 

A DNA sequence coding for the desired polypeptide, for example a cDNA sequence 
encoding a full length protein, will preferably be combined with transcriptional and translational 
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initiation regulatory sequences which will direct the transcription of the sequence from the gene 
in the intended tissues of the transformed plant* 

For example, for over-expression, a plant promoter fragment may be employed that will 
direct transcription of the gene in all tissues of a regenerated plant. Altematively, the plant 
5 promoter may direct transcription of an SDF of the invention in a specific tissue (tissue-specific 
promoters) or may be otherwise under more precise environmental control (inducible 
promoters). 

If proper polypeptide productionis desired, a polyadenylation region at the 3 -end of the 
coding region is typically included. The polyadenylation region can be derived from the natural 
1 0 gene, from a variety of other plant genes, or from T-DNA. 

The vector comprising the sequences from genes or SDF or the invention may 
comprise a marker gene that confers a selectable phenotype on plant cells. The vector can 
include promoter and coding sequence, for instance. For example, the marker may encode 
biocide resistance, particularly antibiotic resistance, such as resistance to kanamycin, G418, 
1 5 bleomycin, hygromycin, or herbicide resistance, such as resistance to chlorosulfuron or 
phosphinotricin. 

IV.A. Coding Sequences 

Generally, the sequence in the transformation vector and to be introduced into 
the genome of the host cell does not need to be absolutely identical to an SDF of the present 

2 0 invention. Also, it is not necessary for it to be full length, relative to either the primary 

transcription product or fully processed mRNA. Furthermore, the introduced sequence need not 
have the same intron or exon pattem as a native gene. Also, heterologous non-coding segments 
can be incorporated into the coding sequence without changing the desired amino acid sequence 
of the polypeptide to be produced. 

25 IV.B. Promoters 

As explained above, introducing an exogenous SDF from the same species or an 
orthologous SDF from another species can modulate the expression of a native gene 
corresponding to that SDF of interest. Such an SDF construct can be under the control of 
either a constitutive promoter or a highly regulated inducible promoter (e.g., a copper 

3 0 inducible promoter). The promoter of interest can initially be either endogenous or 

heterologous to the species in question. When re-introduced into the genome of said species, 
such promoter becomes exogenous to said species. Over-expression of an SDF transgene can 
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lead to co-suppression of the homologous endogeneous sequence thereby creating some 
alterations in the phenotypes of the transformed species as demonstrated by similar analysis 
of the chalcone synthase gene (Napoli et al., Plant Cell 2:279 (1990) and van der Rrol et al., 
Plant Cell 2:291 (1990)). If an SDF is found to encode a protein with desirable 
5 characteristics, its over-production can be controlled so that its accumulation can be 
manipulated in an organ- or tissue-specific manner utilizing a promoter having such 
specificity. 

Likewise, if the promoter of an SDF (or an SDF that includes a promoter) is found to 
be tissue-specific or developmentally regulated, such a promoter can be utilized to drive or 
1 0 facilitate the transcription of a specific gene of interest (e.g., seed storage protein or root- 
specific protein). Thus, the level of accumulation of a particular protein can be manipulated 
or its spatial localization in an organ- or tissue- specific manner can be altered. 

IV. C Signal Peptides 

15 SDFs of the present invention containing signal peptides are indicated in Table 1. In 

some cases it may be desirable for the protein encoded by an introduced exogenous or 
orthologous SDF to be targeted (1) to a particular organelle intracellular compartment, (2) to 
interact with a particular molecule such as a membrane molecule or (3) for secretion outside 
of the cell harboring the introduced SDF. This will be accomplished using a signal peptide. 

2 0 Signal peptides direct protein targeting, are involved in ligand-receptor interactions 

and act in cell to cell communication. Many proteins, especially soluble proteins, contain a 
signal peptide that targets the protein to one of several different intracellular compartments. 
In plants, these compartments include, but are not limited to, the endoplasmic reticulum (ER), 
mitochondria, plastids (such as chloroplasts), the vacuole, the Golgi apparatus, protein 

2 5 storage vessicles (PSV) and, in general, membranes. Some signal peptide sequences are 

conserved, such as the Asn-Pro-Ile-Arg amino acid motif found in the N-terminal propeptide 
signal that targets proteins to the vacuole (Marty (1999) The Plant Cell 11: 587-599). Other 
signal peptides do not have a consensus sequence per 5e, but are largely composed of 
hydrophobic amino acids, such as those signal peptides targeting proteins to the ER (Vitale 

3 0 and Denecke (1999) The Plant Cell 11: 615-628). Still others do not appear to contain either 

a consensus sequence or an identified common secondary sequence, for instance the 
chloroplast stromal targeting signal peptides (Keegstra and Cline (1999) The Plant Cell 11: 
557-570). Furthermore, some targeting peptides are bipartite, directing proteins first to an 
organelle and then to a membrane within the organelle (e.g. within the thylakoid lumen of the 
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chloroplast; see Keegstra and Cline (1999) The Plant Cell 11: 557-570). In addition to the 
diversity in sequence and secondary structure, placement of the signal peptide is also varied. 
Proteins destined for the vacuole, for example, have targeting signal peptides found at the N- 
terminus, at the C-terminus and at a surface location in mature, folded proteins. Signal 
5 peptides also serve as ligands for some receptors. 

These characteristics of signal proteins can be used to more tighdy control the 
phenotypic expression of introduced SDFs, In particular, associating the appropriate signal 
sequence with a specific SDF can allow sequestering of the protein in specific organelles 
(plastids, as an example), secretion outside of the cell, targeting interaction with particular 
1 0 receptors, etc. Hence, the inclusion of signal proteins in constructs involving the SDFs of the 
invention increases the range of manipulation of SDF phenotypic expression. The nucleotide 
sequence of the signal peptide can be isolated from characterized genes using common 
molecular biological techniques or can be synthesized in vitro. 

In addition, the native signal peptide sequences, both amino acid and nucleotide, 
described in Table 1 can be used to modulate polypeptide transport. Further variants of the 
native signal peptides described in Table 1 are contemplated. Insertions, deletions, or 
substitutions can be made. Such variants will retain at least one of the functions of the native 
signal peptide as well as exhibiting some degree of sequence identity to the native sequence. 

Also, fragments of the signal peptides of the invention are useful and can be fused with 
other signal peptides of interest to modulate transport of a polypeptide. 

V, Transformation Techniques 

15 A wide range of techniques for inserting exogenous polynucleotides are known for a 

number of host cells, including, without limitation, bacterial, yeast, mammalian, insect and plant 
cells. 

Techniques for transforming a wide variety of higher plant species are well known and 
described in the technical and scientific literature. See, e.g. Weising et ah, Ann. Rev. Genet. 
2 0 22:421 (1988); and Christou, Euphytica, v. 85, n,l-3:13-27, (1995). 

DNA constructs of the invention may be introduced into the genome of the desired plant 
host by a variety of conventional techniques. For example, the DNA construct may be 
introduced directly into the genomic DNA of the plant cell using techniques such as 
electroporation and microinjection of plant cell protoplasts, or the DNA constructs can be 
2 5 introduced directly to plant tissue using ballistic methods, such as DNA particle bombardment. 
Alternatively, the DNA constructs may be combined with suitable T-DNA flanking regions and 
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introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions 
of tht Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent 
marker into the plant cell DNA when the cell is infected by the bacteria (McCormac et al., Mol 
Biotechnol 8:199 (1997); Hamilton, Gene 200:107 (1997)); Salomon et al. EMBOJ, 3:141 
5 (1984); Herrera-Estrella et al. EMBO J. 2:987 (1983). 

Microinjection techniques are known in the art and well described in the scientific and 
patent literature. The introduction of DNA constructs using polyethylene glycol precipitation is 
described in Paszkowski et al. EMBOJ, 3:2717 (1984). Electroporation techniques are 
described in Fromm et al. Proc, Natl Acad ScL USA 82:5824 (1985). Ballistic transformation 

1 0 techniques are described in Klein et al. Nature y2n:.112> (1987). Agrobacterium 

tumefaciens-m^dmXed transformation techniques, including disarming and use of binary or co- 
integrate vectors, are well described in the scientific literature. See, for example Hamilton, CM, 
Gene 200:107 (1997); Muller et al. Mol Gen. Genet 207:171 (1987); Komari et al. Plant J. 
10:165 (1996); Venkateswarlu et aL Biotechnology 9:1103 (1991) and Glcavc, AP., Plant MoL 

1 5 BioL 20:1203 (1992); Graves and Goldman, Plant Mol. Biol 1-34 (1986) and Gould et al.. Plant 
Physiology 95:426 (1991). 

Transformed plant cells which are derived by any of the above transformation 
techniques can be cultured to regenerate a whole plant that possesses the transformed genotype 
and thus the desired phenotype such as seedlessness. Such regeneration techniques rely on 

2 0 manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a 
biocide and/or herbicide marker which has been introduced together with the desired nucleotide 
sequences. Plant regeneration from cultured protoplasts is described in Evans et al.. Protoplasts 
Isolation and Culture in Handbook of Plant Cell Culture," pp. 124-176, MacMillan Publishing 
Company, New York, 1983; and Binding, Regeneration of Plants, Plant Protoplasts, pp. 21-73, 

2 5 CRC Press, Boca Raton, 1988. Regeneration can also be obtained from plant callus, explants, 

organs, or parts thereof. Such regeneration techniques are described generally in Klee et al. Ann. 
Rev. of Plant Phys. 38:467 (1987), Regeneration of monocots (rice) is described by Hosoyama 
et al. (Biosci. Biotechnol Biochem. 58:1500 (1994)) and by Ghosh et al. (J. Biotechnol 32:1 
(1994)). The nucleic acids of the invention can be used to confer desired traits on essentially any 

3 0 plant. 

Thus, the invention has use over a broad range of plants, including species from the 
gentm Anacardium, Arachis, Asparagus, Atropa, Avena, Brassica, Citrus, Citrullus, Capsicum, 
Carthamus, Cocos, Coffea, Cucumis, Cucurbita, Daucus, Elaeis, Fragaria, Glycine, Gossypium, 
Helianthus, Heterocallis, Hordeum, Hyoscyamus, Lactuca, Linum, Lolium,Lupinus, 
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Lycopersicon^ MaluSy Manihot, MajoranUy Medicago, Nicotiana, Olea, Oryza, Panieum, 
Pannesetum, Persea, Phaseolus, Pistachia, Pisum, Pyrus, Prunus, Raphanus, Ricinus, Secale, 
SeneciOy Sinapis, Solarium, Sorghum, Theobromus, Trigonella, Triticum, Vicia, Vitis, Vigna, 
and, Zea, 

5 One of skill will recognize that after the expression cassette is stably incorporated in 

transgenic plants and confirmed to be operable, it can be introduced into other plants by 
sexual crossing. Any of a number of standard breeding techniques can be used, depending 
upon the species to be crossed. 

The particular sequences of SDFs identified are provided in the attached TABLE 1. 
1 0 One of ordinary skill in the art, having this data, can obtain cloned DNA fragments, synthetic 
DNA fragments or polypeptides constituting desired sequences by recombinant methodology 
known in the art or described herein. 

EXAMPLES 

The invention is illustrated by way of the following examples. The invention is not 
1 5 limited by these examples as the scope of the invention is defined solely by the claims 
following. 

EXAMPLE 1: cDNA PREPARATION 

A number of the nucleotide sequences disclosed in TABLE 1 herein as representative of 
the SDFs of the invention can be obtained by sequencing genomic DNA (gDNA) and/or cDNA 

2 0 from com plants grown from HYBRID SEED # 35A19, purchased from Pioneer Hi-Bred 

International, Inc., Supply Management, P.O. Box 256, Johnston, Iowa 50131-0256, 

A number of the nucleotide sequences disclosed in TABLE 1 herein as representative 
of the SDFs of the invention can also be obtained by sequencing genomic DNA from 
Arabidopsis thaliana, Wassilewskija ecotype or by sequencing cDNA obtained from mRNA 
25 from such plants as described below. This is a true breeding strain. Seeds of the plant are 
available from the Arabidopsis Biological Resource Center at the Ohio State University, 
under the accession number CS2360. Seeds of this plant were deposited under the terms and 
conditions of the Budapest Treaty at the American Type Culture Collection, Manassas, VA 
on August 31, 1999, and were assigned ATCC No. PTA-595. 

3 0 Other methods for cloning full-length cDNA are described, for example, by Seki et 

al., Plant Journal 15:707-720 (1998) High-efficiency cloning of Arabidopsis full-length 
cDNA by biotinylated Cap trapper"; Maruyama et al., Gene 138:171 (1994) Oligo-capping a 
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simple method to replace the cap structure of eukaryotic mRNAs with oligoribonucleotides"; 
and WO 96/34981. 

Tissues were, or each organ was, individually pulverized and frozen in liquid 
nitrogen. Next, the samples were homogenized in the presence of detergents and then 
5 centrifuged. The debris and nuclei were removed from the sample and more detergents were 
added to the sample. The sample was centrifuged and the debris was removed. Then the 
sample was applied to a 2M sucrose cushion to isolate polysomes. The RNA was isolated by 
treatment with detergents and proteinase K followed by ethanol precipitation and 
centrifugation. The polysomal RNA from the different tissues was pooled according to the 
10 following mass ratios: 15/15/1 for male inflorescences, female inflorescences and root, 
respectively. The pooled material was then used for cDNA synthesis by the methods 
described below. 

f-ii Starting material for cDNA synthesis for the exemplary com cDNA clones 

with sequences presented in TABLE 1 was poly(A)-containing polysomal mRNAs from 
iCi 1 5 inflorescences and root tissues of corn plants grown from HYBRID SEED # 35A19. Male 

1 inflorescences and female (pre-and post-fertilization) inflorescences were isolated at various 

f'^ stages of development. Selection for poly(A) containing polysomal RNA was done using 

:= oligo d(T) cellulose columns, as described by Cox and Goldberg, Plant Molecular Biology: 

A Practical Approach", pp. 1-35, Shaw ed., c. 1988 by IRL, Oxford. The quality and the 
ni 2 0 integrity of the polyA+ RNAs were evaluated. 



Starting material for cDNA synthesis for the txG^mpldiry Arabidopsis cDNA 
clones with sequences presented in TABLE 1 was polysomal RNA isolated from the top- 
most inflorescence tissues of Arahidopsis thaliana Wassilewskija (Ws.) and from roots of 

2 5 Arabidopsis thaliana Landsberg erecta (L. er,), also obtained from the Arabidopsis 

Biological Resource Center. Nine parts inflorescence to every part root was used, as 
measured by wet mass. Tissue was pulverized and exposed to liquid nitrogen. Next, the 
sample was homogenized in the presence of detergents and then centrifuged. The debris and 
nuclei were removed from the sample and more detergents were added to the sample. The 

3 0 sample was centrifuged and the debris was removed and the sample was applied to a 2M 

sucrose cushion to isolate polysomal RNA. Cox et al.. Plant Molecular Biology: A Practical 
Approach", pp. 1-35, Shaw ed., c. 1988 by IRL, Oxford. The polysomal RNA was used 
for cDNA synthesis by the methods described below. Polysomal mRNA was then isolated as 
described above for com cDNA. The quality of the RNA was assessed electrophoretically. 
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Following preparation of the mRNAs from various tissues as described above, selection 
of mRNA with intact 5' ends and specific attachment of an oligonucleotide tag to the 5' end of 
such mRNA was performed using either a chemical or enzymatic approach. Both techniques 
take advantage of the presence of the cap" structure, which characterizes the 5 ' end of most 
5 intact mRNAs and which comprises a guanosine generally methylated once, at the 7 position. 

The chemical modification approach involves the optional elimination of the 2\ 3'-cis 
diol of the 3' terminal ribose, the oxidation of the 2\ 3'-cis diol of the ribose linked to the cap of 
the 5 ' ends of the mRNAs into a dialdehyde, and the coupling of the such obtained dialdehyde to 
a derivatized oligonucleotide tag. Further detail regarding the chemical approaches for 
1 0 obtaining mRNAs having intact 5' ends are disclosed in Intemational Application No. 
W096/34981 published November 7, 1996. 

The enzymatic approach for ligating the oligonucleotide tag to the intact 5' ends of 
mRNAs involves the removal of the phosphate groups present on the 5' ends of uncapped 
incomplete mRNAs, the subsequent decapping of mRNAs having intact 5' ends and the ligation 
15 of the phosphate present at the 5' end of the decapped mRNA to an oUgonucleotide tag. Further 
detail regarding the enzymatic approaches for obtaining mRNAs having intact 5' ends are 
disclosed in Dumas Milne Edwards J.B. (Doctoral Thesis of Paris VI University, Le clonage des 
ADNc complets: difficultes et perspectives nouvelles. Apports pour Fetude de la regulation de 
I'expression de la tryptophane hydroxylase de rat, 20 Dec. 1993), EPO 625572 and Kato etal, 
2 0 Gene 150:243-250 (1994). 

In both the chemical and the enzymatic approach, the oligonucleotide tag has a 
restriction enzyme site (e.g. an EcoRI site) therein to facilitate later cloning procedures. 
Following attachment of the oligonucleotide tag to the mRNA, the integrity of the mRNA is 
examined by performing a Northern blot using a probe complementary to the oligonucleotide 

2 5 tag. 

For the mRNAs joined to oligonucleotide tags using either the chemical or the enzymatic 
method, first strand cDNA synthesis is performed using an oligo-dT primer with reverse 
transcriptase. This oligo-dT primer can contain an internal tag of at least 4 nucleotides, which 
can be different from one mRNA preparation to another. Methylated dCTP is used for cDNA 

3 0 first strand synthesis to protect the intemal EcoRI sites from digestion during subsequent steps. 

The first strand cDNA is precipitated using isopropanol after removal of RNA by alkaline 
hydrolysis to eliminate residual primers. 

Second strand cDNA synthesis is conducted using a DNA polymerase, such as Klenow 
fragment and a primer corresponding to the 5' end of the ligated oligonucleotide. The primer is 
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typically 20-25 bases in length. Methylated dCTP is used for second strand synthesis in order to 
protect internal EcoRI sites in the cDNA from digestion during the cloning process. 

Following second strand synthesis, the full-length cDNAs are cloned into a phagemid 
vector, such as pBlueScript™ (Stratagene), The ends of the full-length cDNAs are blunted with 
5 T4 DNA polymerase (Biolabs) and the cDNA is digested with EcoRI. Since methylated dCTP 
is used during cDNA synthesis, the EcoRI site present in the tag is the only hemi-methylated 
site; hence the only site susceptible to EcoRI digestion. In some instances, to facilitate 
subcloning, an Hind III adapter is added to the 3' end of full-length cDNAs. 

The full-length cDNAs are then size fractionated using either exclusion chromatography 
1 0 (AcA, Biosepra) or electrophoretic separation which yields 3 to 6 different fractions. The full- 
length cDNAs are then directionally cloned either into pBlueScript™ using either the EcoRI and 
Smal restriction sites or, when the Hind III adapter is present in the full-length cDNAs, the 
EcoRI and Hind III restriction sites. The ligation mixture is transformed, preferably by 
y electroporation, into bacteria, which are then propagated under appropriate antibiotic selection. 

;;;y 1 5 Clones containing the oligonucleotide tag attached to full-length cDNAs are selected as 

follows. 

The plasmid cDNA libraries made as described above are purified (e.g. by a column 
g available from Qiagen). A positive selection of the tagged clones is performed as follows. 

Briefly, in this selection procedure, the plasmid DNA is converted to single stranded DNA using 
flJ 2 0 phage Fl gene II endonuclease in combination with an exonuclease (Chang et al., Gene 127 :95 
fi: (1993)) such as exonuclease III or T7 gene 6 exonuclease. The resulting single stranded DNA is 

then purified using paramagnetic beads as described by Fry et al., Biotechniques 13: 124 (1992). 

Here the single stranded DNA is hybridized with a biotinylated oligonucleotide having a 

sequence corresponding to the 3' end of the oligonucleotide tag. Preferably, the primer has a 

2 5 length of 20-25 bases. Clones including a sequence complementary to the biotinylated 

oligonucleotide are selected by incubation with streptavidin coated magnetic beads followed by 
magnetic capture. After capture of the positive clones, the plasmid DNA is released from the 
magnetic beads and converted into double stranded DNA using a DNA pol}mierase such as 
ThermoSequenase™ (obtained from Amersham Pharmacia Biotech). Altematively, protocols 

3 0 such as the Gene Trapper^"^ kit (Gibco BRL) can be used. The double stranded DNA is then 

transformed, preferably by electroporation, into bacteria. The percentage of positive clones 
having the 5' tag oligonucleotide is typically estimated to be between 90 and 98% from dot blot 
analysis. 
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Following transformation, the libraries are ordered in microtiter plates and sequenced. 
The Arabidopsis library was deposited at the American Type Culture Collection on January 
7, 2000 as E-coli liba 010600" under the accession number PTA-1161 . 
EXAMPLE 2: SOUTHERN HYBRIDIZATIONS 
5 The SDFs of the invention can be used in Southern hybridizations as described above. 

The following describes extraction of DNA from nuclei of plant cells, digestion of the 
nuclear DNA and separation by length, transfer of the separated fragments to membranes, 
preparation of probes for hybridization, hybridization and detection of the hybridized probe. 
The procedures described herein can be used to isolate related polynucleotides or for 

1 0 diagnostic purposes. Moderate stringency hybridization conditions, as defined above, are 
described in the present example. These conditions result in detection of hybridization 
between sequences having at least 70% sequence identity. As described above, the 
hybridization and wash conditions can be changed to reflect the desired percenatge of 
sequence identity between probe and target sequences that can be detected. 

15 In the following procedure, a probe for hybridization is produced from two PGR 

reactions using two primers from genomic sequence of Arabidopsis thaliana. As described 
above, the particular template for generating the probe can be any desired template. 

The first PCR product is assessed to validate the size of the primer to assure it is of 
the expected size. Then the product of the first PCR is used as a template, with the same pair 

2 0 of primers used in the first PCR, in a second PCR that produces a labeled product used as the 
probe. 

Fragments detected by hybridization, or other bands of interest, can be isolated from 
gels used to separate genomic DNA fragments by known methods for further purification 
and/or characterization. 

2 5 Buffers for nuclear DNA extraction 

1. lOXHB 





1000 ml 




40 mM spermidine 


10.2 g 


Spermine (Sigma S-2876) and spermidine (Sigma 
S-2501) 


10 mM spermine 


3.5 g 


Stabilize chromatin and the nuclear membrane 
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0.1 M EDTA 
(disodium) 


37.2 g 


EDTA inhibits nuclease 


0.1 M Tris 


12.1 g 


Buffer 


0.8 M KCl 


59.6 g 


Adjusts ionic strength for stability of nuclei 



Adjust pH to 9.5 with 10 N NaOH. It appears that there is a nuclease present in 
leaves. Use of pH 9.5 appears to inactivate this nuclease. 

2. 2 M sucrose (684 g per 1000 ml) 

Heat about half the final volume of water to about 50°C, Add the sucrose slowly then 
bring the mixture to close to final volume; stir constantly until it has dissolved. Bring 
the solution to volume. 

3. Sarkosyl solution (lyses nuclear membranes) 

1000 ml 

N-lauroyl sarcosine (Sarkosyl) 20.0 g 

0.1 M Tris 12.1 g 

0.04 M EDTA (Disodium) 14.9 g 

Adjust the pH to 9.5 after all the components are dissolved and bring up to the proper 
volume. 

4. 20% Triton X-100 
80 ml Triton X-100 

320 ml IxHB (w/o p-ME and PMSF) 

Prepare in advance; Triton takes some time to dissolve 



A. Procedure 

1. Prepare IX H" buffer (keep ice-cold during use) 
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1000 ml 



lOXHB 



100 ml 



2 M sucrose 



250 ml a non-ionic osraoticum 



Water 



634 ml 



Added just before use: 



100 mM PMSF* 



10 ml a protease inhibitor; protects 

nuclear membrane proteins 

1 ml inactivates nuclease by reducing 



B-mercaptoethanol 



disulfide bonds 



If 10 

2. 

riJ 15 

3. 
4, 

20 



*100 mM PMSF 

(phenyl methyl sulfonyl fluoride, Sigma P-7626) 
(add 0.0875 g to 5 ml 100% ethanol) 

Homogenize the tissue in a blender (use 300-400 ml of IxHB per blender). Be sure 
that you use 5-10 ml of HB buffer per gram of tissue. Blenders generate heat so be 
sure to keep the homogenate cold. It is necessary to put the blenders in ice 
periodically. 

Add the 20% Triton X-100 (25 ml per liter of homogenate) and gently stir on ice for 
20 min. This lyses plastid, but not nuclear, membranes. 

Filter the tissue suspension through several nylon filters into an ice-cold beaker. The 
first filtration is through a 250-micron membrane; the second is through an 85-micron 
membrane; the third is through a 50-micron membrane; and the fourth is through a 
20-micron membrane. Use a large funnel to hold the filters. Filtration can be sped up 
by gently squeezing the liquid through the filters. 

Centrifuge the filtrate at 1200 x g for 20 min. at 4°C to pellet the nuclei. 
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6. Discard the dark green supernatant. The pellet will have several layers to it. One is 
starch; it is white and gritty. The nuclei are gray and soft. In the early steps^ there 
may be a dark green and somewhat viscous layer of chloroplasts. 

Wash the pellets in about 25 ml cold H buffer (with Triton X-100) and resuspend by 
5 swirling gently and pipetting. After the pellets are resuspended. 

Pellet the nuclei again at 1200 - 1300 x g. Discard the supernatant. 

Repeat the wash 3-4 times until the supernatant has changed from a dark green to a 
pale green. This usually happens after 3 or 4 resuspensions. At this point, the pellet 
is typically grayish white and very slippery. The Triton X-100 in these repeated steps 
helps to destroy the chloroplasts and mitochondria that contaminate the prep. 

Resuspend the nuclei for a final time in a total of 15 ml of H buffer and transfer the 
suspension to a sterile 125 ml Erlenmeyer flask. 

7. Add 15 ml, dropwise, cold 2% Sarkosyl, 0.1 M Tris, 0.04 M EDTA solution (pH 9.5) 
while swirling gently. This lyses the nuclei. The solution will become very viscous, 

8. Add 30 grams of CsCl and gently swirl at room temperature until the CsCl is in 
solution. The mixture will be gray, white and viscous. 

9. Centrifuge the solution at 11,400 x g at 4^C for at least 30 min. The longer this spin 
is, the firmer the protein pellicle. 

2 0 10. The result is typically a clear green supernatant over a white pellet, and (perhaps) 

under a protein pellicle. Carefully remove the solution under the protein pellicle and 
above the pellet. Determine the density of the solution by weighing 1 ml of solution 
and add CsCl if necessary to bring to 1.57 g/ml. The solution contains dissolved 
solids (sucrose etc) and the refractive index alone will not be an accurate guide to 
2 5 CsCl concentration. 
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11. Add 20 )al of 10 mg/ml EtBr per ml of solution. 

12. Centrifuge at 184,000 x g for 16 to 20 hours in a fixed-angle rotor, 

13. Remove the dark red supernatant that is at the top of the tube with a plastic transfer 
pipette and discard. Carefully remove the DNA band with another transfer pipette. 
The DNA band is usually visible in room light; otherwise, use a long wave UV light 
to locate the band. 

14. Extract the ethidium bromide with isopropanol saturated with water and salt. Once 
the solution is clear, extract at least two more times to ensure that all of the EtBr is 
gone. Be very gentle, as it is very easy to shear the DNA at this step. This extraction 
may take a while because the DNA solution tends to be very viscous. If the solution 
is too viscous, dilute it with TE, 

15. Dialyze the DNA for at least two days against several changes (at least three times) of 
TE (10 mM Tris, ImM EDTA, pH 8) to remove the cesium chloride. 

16. Remove the dialyzed DNA from the tubing. If the dialyzed DNA solution contains a 
lot of debris, centrifuge the DNA solution at least at 2500 x g for 10 min. and 
carefully transfer the clear supernatant to a new tube. Read the A260 concentration of 
the DNA. 

17. Assess the quality of the DNA by agarose gel electrophoresis (1% agarose gel) of the 
DNA. Load 50 ng and 100 ng (based on the OD reading) and compare it with known 
and good quality DNA. Undigested lambda DNA and a lambda-Hindlll-digested 
DNA are good molecular weight makers. 

Protocol for Digestion of Genomic DNA 

Protocol : 

1. The relative amounts of DNA for different crop plants that provide approximately a 
balanced number of genome equivalent is given in Table 3. Note that due to the size 
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of the wheat genome, wheat DNA will be underrepresented. Lambda DNA provides 
a useful control for complete digestion. 



2. Precipitate the DNA by adding 3 volumes of 100% ethanoL Incubate at -20°C for at 
least two hours. Yeast DNA can be purchased and made up at the necessary 
5 concentration, therefore no precipitation is necessary for yeast DNA. 



3. Centrifuge the solution at 11,400 x g for 20 min. Decant the ethanol carefully (be 
careful not to disturb the pellet). Be sure that the residual ethanol is completely 
removed either by vacuum desiccation or by carefully wiping the sides of the tubes 
with a clean tissue. 



10 4. Resuspend the pellet in an appropriate volume of water. Be sure the pellet is fully 

resuspended before proceeding to the next step. This may take about 30 min, 

5. Add the appropriate volume of lOX reaction buffer provided by the manufacturer of 
the restriction enzyme to the resuspended DNA followed by the appropriate volume 
of enzymes. Be sure to mix it properly by slowly swirling the tubes. 

15 6. Set-up the lambda digestion-control for each DNA that you are digesting. 

7. Incubate both the experimental and lambda digests overnight at 37 C. Spin down 
condensation in a microfuge before proceeding. 

8. After digestion, add 2 jal of loading dye (typically 0.25% bromophenol blue, 0.25% 
xylene cyanol in 15% FicoU or 30% glycerol) to the lambda-control digests and load 

20 in 1% TPE-agarose gel (TPE is 90 mM Tris-phosphate, 2 mM EDTA, pH 8). If the 

lambda DNA in the lambda control digests are completely digested, proceed with the 
precipitation of the genomic DNA in the digests. 



9 



Precipitate the digested DNA by adding 3 volumes of 100% ethanol and incubating in 
-20^C for at least 2 hours (preferably overnight). 
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EXCEPTION: Arabidopsis and yeast DNA are digested in an appropriate volume; 

they don't have to be precipitated. 

10. Resuspend the DNA in an appropriate volume of TE (e.g., 22 |al x 50 blots = 1100 |al) 
and an appropriate volume of lOX loading dye (e.g., 2.4 |iil x 50 blots = 120 Be 
careful in pipetting the loading dye - it is viscous. Be sure you are pipetting the 
correct volume. 

Table 3 



Some guide points in digesting genomic DNA. 



Species 


Genome 
Size 


lSizp Tiplfitivp to 

Arabidopsis 


Genome 

iicr A t^fftWirlrhncf G 

DNA 


Amount 
rtpr hint 


Arabidopsis 


120 Mb 


IX 






Brassica 


1,100 Mb 


9.2X 


0.54X 




Com 


2,800 Mb 


23. 3X 


0.43X 


20 


Cotton 


2,300 Mb 


1 ^^.2'^?^ 


0.52X 


20 ng 


Oat 


11,300 Mb 


94X 


O.llX 


20 ng 


Rice 


400 Mb 


3.3X 


0.75X 


5 Mg 


Soybean 


1,100 Mb 


9.2X 


0.54X 


10 lag 


Sugarbeet 


758 Mb 


6.3X 


0.8X 


10 Mg 


Sweetclover 


1,100 Mb 


9.2X 


0.54X 


10 ^g 


Wheat 


16,000 Mb 


133X 


0.08X 


20 \x.g 


Yeast 


15 Mb 


0.12X 


IX 


0.25 (ig 



Protocol for Southern Blot Analysis 

The digested DNA samples are electrophoresed in 1% agarose gels in Ix TPE buffer. 
Low voltage; overnight separations are preferred. The gels are stained with EtBr and 
photographed. 
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1. For blotting the gels^ first incubate the gel in 0,25 N HCl (with gentle shaking) for 
about 15 min. 

2. Then briefly rinse with water. The DNA is denatured by 2 incubations. Incubate 
(with shaking) in 0.5 M NaOH in 1.5 M NaCl for 15 min. 

3. The gel is then briefly rinsed in water and neutralized by incubating twice (with 
shaking) in 1.5 M Tris pH 7.5 in 1.5 M NaCl for 15 min. 

4. A nylon membrane is prepared by soaking it in water for at least 5 min, then in 6X 
SSC for at least 15 min. before use. (20x SSC is 175.3 g NaCl, 88.2 g sodium citrate 
per liter, adjusted to pH 7.0.) 

5. The nylon membrane is placed on top of the gel and all bubbles in between are 
removed. The DNA is blotted from the gel to the membrane using an absorbent 
medium, such as paper toweling and 6x SCC buffer. After the transfer, the membrane 
may be lightly brushed with a gloved hand to remove any agarose sticking to the 
surface. 

6. The DNA is then fixed to the membrane by UV crosslinking and baking at SO^'C. The 
membrane is stored at 4^C until use. 

B. Protocol for PCR Amplification of Genomic Fragments in Arabidopsis 



Amplification procedures : 

1 . Mix the following in a 0.20 ml PCR tube or 96-well PCR plate: 



Volume 


Stock 


Final Amount or Cone* 


0.5 |ul 


- 10 ng/|al genomic DNA^ 


5 ng 


2.5 III 


lOX PCR buffer 


20 mM Tris, 50 mM KCl 



^ Arabidopsis DNA is used in the present experiment, but the procedure is a general one. 
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0.75 111 


50 mM MgCl2 


1.5 mM 


1 ^1 


10 pmol/|j.l Primer 1 (Forward) 


10 pmol 


1 1^1 


10 pmol/^1 Primer 2 (Reverse) 


10 pmol 


0.5 fal 


5 mM dNTPs 


0.1 mM 


0.1 111 


5 units/ial Platinum Taq™ (Life 
Technologies, Gaithersburg, MD) 
DNA Polymerase 


1 imits 


(to 25 ^l) 


Water 





2. The template DNA is amplified using a Perkin Elmer 9700 PCR machine: 



1) 94°C for 10 min. followed by 



2) 

5 cycles: 


3) 

5 cycles: 


4) 

25 cycles: 


94 °C - 30 sec 
62 ''C- 30 sec 
72 °C- 3 min 


94 °C- 30 sec 
58 "C - 30 sec 
72 °C - 3 min 


94 °C - 30 sec 
53 °C- 30 sec 
72 °C - 3 min 



5) 72°C for 7 min. Then the reactions are stopped by chilling to 4°C. 
The procedure can be adapted to a multi-well format if necessary. 
Quantification and Dilution of PCR Products: 

1. The product of the PCR is analyzed by electrophoresis in a 1% agarose gel. A 

linearized plasmid DNA can be used as a quantification standard (usually at 50, 100, 
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200, and 400 ng). These will be used as references to approximate the amount of 
PGR products, Hindlll-digested Lambda DNA is useful as a molecular weight 
marker. The gel can be run fairly quickly; e.g., at 100 volts. The standard gel is 
examined to determine that the size of the PGR products is consistent with the 
expected size and if there are significant extra bands or smeary products in the PGR 
reactions. 

2. The amounts of PGR products can be estimated on the basis of the plasmid standard. 

3. For the small number of reactions that produce extraneous bands, a small amount of 
DNA from bands with the correct size can be isolated by dipping a sterile 10-|al tip 
into the band while viewing though a UV Transilluminator, The small amount of 
agarose gel (with the DNA fragment) is used in the labeling reaction. 

C. Protocol for PCR-DIG-Labeling of DNA 

Solutions : 

Reagents in PGR reactions (diluted PGR products, lOX PGR Buffer, 50 mM MgGb, 5 
U/|al Platinum Taq Polymerase, and the primers) 

lOX dNTP + DIG ll-dUTP [1:5]: (2 mM dATP, 2 mM dCTP, 2 mM dOTP, 1,65 
mM dTTP, 0.35 mM DIG-ll-dUTP) 

lOX dNTP H- DIG-ll-dUTP [1:10]: (2 mM dATP, 2 mM dGTP, 2 mM dGTP, 1.81 
mM dTTP, 0.19 mM DIG-ll-dUTP) 

lOX dNTP + DIG-ll-dUTP [1:15]: (2 mM dATP, 2 mM dGTP, 2 mM dGTP, 1.875 
mM dTTP, 0.125 mM DIG-ll-dUTP) 

TE buffer (10 mM Tris, 1 mM EDTA, pH 8) 

Maleate buffer: In 700 ml of deionized distilled water, dissolve 1T61 g maleic acid 
and 8.77 g NaGl. Add NaOH to adjust the pH to 7.5. Bring the volume to 1 L. Stir 
for 15 min. and sterilize. 
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10% blocking solution: In 80 ml deionized distilled water, dissolve 1.16g maleic 
acid. Next, add NaOH to adjust the pH to 7.5. Add 10 g of the blocking reagent 
powder (Boehringer Mannheim, Indianapolis, IN, Cat. no. 1096176). Heat to 60°C 
while stirring to dissolve the powder. Adjust the volume to 100 ml with water. Stir 
5 and sterilize. 

1% blocking solution: Dilute the 10% stock to 1% using the maleate buffer. 

Buffer 3 (100 mM Tris, 100 mM NaCl, 50 mM MgCls, pH9.5). Prepared from 
autoclaved solutions of IM Tris pH 9.5, 5 M NaCl, and 1 M MgCl2 in autoclaved 
distilled water. 
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Procedure: 



1. PGR reactions are performed in 25 volumes containing; 



PGR buffer 
MgGl2 

lOX dNTP + DIG-ll-dUTP 
Platinum Taq'^'*' Polymerase 
10 pg probe DNA 
10 pmol primer 1 



IX 

1.5 mM 

IX (please see the note below) 
1 unit 



Note: 



1 0X dNTP + DTG-1 1 -dUTP (1:5) 



lOX dNTP + DIG-ll-dUTP (1:10) 
lOX dNTP + DIG-ll-dUTP (1:15) 



Use for : 
< 1 kb 



1 kb to 1.8 kb 
> 1.8 kb 



2. The PGR reaction uses the following amplification cycles: 



1) 


94°G for 10 min. 


















5 cycles: 




5 cycles: 




25 cycles: 


95°G - 


30 sec 


95°G - 


30 sec 


95°G -30 sec 


ere - 


1 min 


59°G - 


1 min 


51°G - 1 min 


73°C - 


5 min 


75 ''C - 


5 min 


TS^G - 5 min 



5) ll^'C for 8 min. The reactions are terminated by chilling to 4°C (hold). 

3. The products are analyzed by electrophoresis- in a 1% agarose gel, comparing to an 
aliquot of the unlabelled probe starting material. 



4. The amount of DIG-labeled probe is determined as follows: 



♦ 
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Make serial dilutions of the diluted control DNA in dilution buffer (TE: 10 mM Tris 
and 1 mM EDTA, pH 8) as shown in the following table: 



DIG-labeled control 
DNA starting cone 


Stepwise Dilution 


Final Cone. (Dilution 
Name) 


5 ng/nl 


1 ix\ in 49 111 TE 


100 pg/nl (A) 


100 (A) 


25 in 25 jj-l TE 


50 pg/^1 (B) 


50 pg/^il (B) 


25 \<lI in 25 jal TE 


25 pg/yl (C) 


25 pg/nl (C) 


20 |Lil in 30 fal TE 


10 pg/^1 (D) 



a. Serial deletions of a DIG-labeled standard DNA ranging from 100 pg to 10 pg 
are spotted onto a positively charged nylon membrane, marking the membrane 

5 lightly with a pencil to identify each dilution. 

b. Serial dilutions (e.g., 1:50, 1:2500, 1:10,000) of the newly labeled DNA probe 
are spotted. 



c. The membrane is fixed by UV crosslinking. 

d. The membrane is wetted with a small amount of maleate buffer and then 
1 0 incubated in 1% blocking solution for 15 min at room temp. 

e. The labeled DNA is then detected using alkaline phosphatase conjugated anti- 
DIG antibody (Boehringer Mannheim, Indianapolis, IN, cat. no. 1093274) and 
an NBT substrate according to the manufacture's instruction. 



1 

15 



Spot intensities of the control and experimental dilutions are then compared to 
estimate the concentration of the PCR-DIG-labeled probe. 
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D. Prehybridization and Hybridization of Southern Blots 

Solutions : 

100% Formamide purchased from Gibco 

20X SSC (IX = 0.15 M NaCl, 0.015 M Nascitrate) 

per L: 175 g NaCl 

87.5gNa3citrate*2H20 

20% Sarkosyl (N-lauroyl-sarcosine) 

20% SDS (sodium dodecyl sulphate) 

10% Blocking Reagent: In 80 ml deionized distilled water, dissolve 1.16 g maleic 

acid. Next, add NaOH to adjust the pH to 7.5. Add 10 g of the blocking reagent 
powder. Heat to 6(FC while stirring to dissolve the powder. Adjust the volume 
to 100 ml with water. Stir and sterilize. 



Prehybridization Mix: 



Final 

Concentration 


Components 


Volume 
(per 100 ml) 


Stock 


50% 


Formamide 


50 ml 


100% 


5X 


SSC 


25 ml 


20X 


0.1% 


Sarkosyl 


0.5 ml 


20% 


0.02% 


SDS 


0.1 ml 


20% 


2% 


Blocking Reagent 


20 ml 


10% 




Water 


4.4 ml 





General Procedures : 

1. Place the blot in a heat-sealable plastic bag and add an appropriate volume of 

prehybridization solution (30 ml/lOOcm^) at room temperature. Seal the bag with a 
heat sealer, avoiding bubbles as much as possible. Lay down the bags in a large 
plastic tray (one tray can accommodate at least 4-5 bags). Ensure that the bags are 
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lying flat in the tray so that the prehybridization solution is evenly distributed 
throughout the bag. Incubate the blot for at least 2 hours with gentle agitation using a 
waver shaker. 

2. Denature DIG-labeled DNA probe by incubating for 10 min. at 98''C using the PGR 
5 machine and immediately cool it to 4°C, 

3. Add probe to prehybridization solution (25 ng/ml; 30 ml = 750 ng total probe) and 
mix well but avoid foaming. Bubbles may lead to background. 

4. Pour off the prehybridization solution from the hybridization bags and add new 
prehybridization and probe solution mixture to the bags containing the membrane. 

5. Incubate with gentle agitation for at least 16 hours. 

6. Proceed to medium stringency post-hybridization wash: 

Three times for 20 min. each with gentle agitation using IX SSC, 1% SDS at 60°C. 

All wash solutions must be prewarmed to 60*'C. Use about 100 ml of wash solution 
per membrane. 

To avoid background keep the membranes fully submerged to avoid drying in spots; 
agitate sufficiently to avoid having membranes stick to one another, 

7. After the wash, proceed to immunological detection and CSPD development. 

E. Procedure for Immunological Detection with CSPD 

Solutions : 

2 0 Buffer 1: Maleic acid buffer (0.1 M maleic acid, 0.15 M NaCl; 

adjusted to pH 7.5 with NaoH) 




Washing buffer: 



Maleic acid buffer with 0.3% (v/v) Tween 20 
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Blocking stock solution 



10% blocking reagent in buffer 1. Dissolve (lOX 
concentration): blocking reagent powder (Boehringer 
Mannheim, Indianapolis, IN, cat, no, 1096176) by 
constantly stirring on a 65°C heating block or heat in a 
microwave, autoclave and store at 4''C. 



Buffer 2 

(IX blocking solution): 



Dilute the stock solution 1:10 in Buffer 1 



Detection buffer: 



0.1 M Tris, 0.1 M NaCl, pH 9,5 



Procedure : 

10 1. After the post-hybridization wash the blots are briefly rinsed (1-5 min.) in the maleate 

washing buffer with gentle shaking. 

2, Then the membranes are incubated for 30 min. in Buffer 2 with gentle shaking. 



3, 



15 



Anti-DIG-AP conjugate (Boehringer Mannheim, Indianapolis, IN, cat. no. 1093274) 
at 75 mU/ml (1:10,000) in Buffer 2 is used for detection. 75 ml of solution can be 
used for 3 blots. 



4. The membrane is incubated for 30 min. in the antibody solution with gentle shaking. 



5. 



6. 



The membrane are washed twice in washing buffer with gentle shaking. About 250 
mis is used per wash for 3 blots. 

The blots are equilibrated for 2-5 min in 60 ml detection buffer. 



2 0 7. Dilute CSPD (1:200) in detection buffer. (This can be prepared ahead of time and 

stored in the dark at 4°C). 



The following steps must be done individually. Bags (one for detection and one for 
exposure) are generally cut and ready before doing the following steps. 
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8. The blot is carefully removed from the detection buffer and excess liquid removed 

vv^ithout drying the membrane. The blot is immediately placed in a bag and 1.5 ml of 
CSPD solution is added. The CSPD solution can be spread over the membrane. 
Bubbles present at the edge and on the surface of the blot are typically removed by 
gentle rubbing. The membrane is incubated for 5 min. in CSPD solution. 



9. Excess liquid is removed and the membrane is blotted briefly (DNA side up) on 
Whatman 3MM paper. Do not let the membrane dry completely. 

10. Seal the damp membrane in a hybridization bag and incubate for 10 min at 31^0 to 
enhance the luminescent reaction. 



10 11. Expose for 2 hours at room temperature to X-ray film. Multiple exposures can be 

taken. Luminescence continues for at least 24 hours and signal intensity increases 
during the first hours. 



Example 3: Transformation of Carrot Cells 

Transformation of plant cells can be accomplished by a number of methods, as 
1 5 described above. Similarly, a number of plant genera can be regenerated from tissue culture 
following transformation. Transformation and regeneration of carrot cells as described herein 
is illustrative. 

Single cell suspension cultures of carrot (Daucus carota) cells are established from 
hypocotyls of cultivar Early Nantes in B5 growth medium (O.L. Gamborg et al.. Plant 

2 0 Physiol 45:372 (1970)) plus 2,4-D and 15 mM CaCl2 (B5 -44 medium) by methods known in 
the art. The suspension cultures are subcultured by adding 10 ml of the suspension culture to 
40 ml of B5-44 medium in 250 ml flasks every 7 days and are maintained in a shaker at 150 
rpm at 27 ""C in the dark. 

The suspension culture cells are transformed with exogenous DNA as described by Z. 

25 Chen et al. Plant MoL Bio. 36:163 (1998). Briefly, 4-days post-subculture cells are incubated 
with cell wall digestion solution containing 0.4 M sorbitol, 2% driselase, 5mM MES (2-[N- 
Morpholino] ethanesulfonic acid) pH 5.0 for 5 hours. The digested cells are pelleted gendy 
at 60 xg for 5 min. and washed twice in W5 solution containing 154 mM NaCl, 5 mM KCl, 
125 mM CaCl2 and 5mM glucose, pH 6.0. The protoplasts are suspended in MC solution 
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containing 5 mM MES, 20 mM CaCb, 0.5 M mannitol, pH 5.7 and the protoplast density is 
adjusted to about 4 x 10^ protoplasts per ml. 

15-60 jag of plasmid DNA is mixed with 0.9 ml of protoplasts. The resulting 
suspension is mixed with 40% polyethylene glycol (MW 8000, PEG 8000), by gentle 
5 inversion a few times at room temperature for 5 to 25 min. Protoplast culture medium known 
in the art is added into the PEG-DNA-protoplast mixture. Protoplasts are incubated in the 
culture medium for 24 hour to 5 days and cell extracts can be used for assay of transient 
expression of the introduced gene. Alternatively, transformed cells can be used to produce 
transgenic callus, which in tum can be used to produce transgenic plants, by methods known 
10 in the art. See, for example, Nomura and Komamine, Pit. Phys, 79:988-991 (1985), 
Identification and Isolation of Single Cells that Produce Somatic Embryos in Carrot 
Suspension Cultures, 

An additional deposit, PTA-1411, of an£. coli Library, E. c(?/?LibA021800, was 
'■Q made at the American Type Culture Collection in Manassas, Virginia, USA on February 22, 

1 5 2000 to meet the requirements of Budapest Treaty for the international recognition of the 
flJ deposit of microorganisms. This deposit was assigned ATCC accession no. PTA-1411. 

%1 The invention being thus described, it will be apparent to one of ordinary skill in the 

' art that various modifications of the materials and methods for practicing the invention can be 

I made. Such modifications are to be considered within the scope of the invention as defined 
fsi 20 by the following claims. 

C'l Each of the references from the patent and periodical literature cited herein is hereby 

f ] expressly incorporated in its entirety by such citation. 



Attorney No. 2750-1300P 



Table 1 



95 





>6041783 


/ 








5 


len = 


103321 


nex = 


192 






>6041783 


/12712 








Init 


15380 


15382 


+ 


9 


1 n 

± u 


inn 2_ 


15661 


15750 


-L 






Intr 


15863 


15991 


+ 


9 




1 eirrri 


16486 


16644 


-I- 


Q 




>6041783 


724664 






15 














Term 


17621 


17354 


— 


11 




Intr 


17939 


17689 


— 


11 




Init 


18573 


18097 


— 


11 


20 


>6041783 


/113989 








Init 


26277 


26522 




15 




Intr 


26631 


26697 


-r 


J. o 




Intr 


26802 


26841 


+ 


15 


51 25 


Intr 


26982 


27114 


+ 


15 




Intr 


27199 


27270 


+ 


15 




Intr 


27353 


27433 


+ 


15 


'■J 


Intr 


27558 


27619 


+ 


15 




Term 


27709 


27781 


+ 


15 


J O U 














>6041783 


71301 








Term 


34670 


34479 


— 


17 




Intr 


34970 


34747 


— 


17 




Intr 


35187 


35070 




1 n 




Intr 


35574 


35281 


— 


17 


'■'■'■^ 


Intr 


35835 


35790 




1 7 




Init 


36096 


35849 


— 


17 


40 


>6041783 


71301 








Term 


36133 


36118 


— 


18 




Init 


36852 


36719 


— 


18 


45 


>6041783 


7121720 








Term 


52226 


52150 


— 


23 




Intr 


53606 


52301 


— 


23 




Intr 


53901 


53717 


— 


23 


50 


Intr 


54183 


54032 


— 


23 




Intr 


54691 


54657 


— 


23 




Intr 


54914 


54798 




23 




Intr 


56977 


54971 




23 




Intr 


57259 


57076 




23 


55 


Intr 


57931 


57612 




23 




Intr 


58219 


58156 




23 




Intr 


58521 


58313 




23 




Intr 


58660 


58607 




23 




Intr 


58889 


58734 




23 


60 


Intr 


59207 


58992 




23 



* 

Attorney No. 2750-1300P 



96 





Intr 


59665 


59448 


— 


23 




Intr 


59827 


59761 


— 


23 




Intr 


60164 


60031 


— 


23 




Intr 


60516 


60417 


— 


23 


5 


Intr 


60778 


60619 


— 


23 




Intr 


61207 


61141 


— 


23 




Intr 


61465 


61429 


— 


23 




Intr 


61788 


61740 


— 


23 




Intr 


62242 


61975 


— 


23 


10 


Intr 


62471 


62330 


— 


23 




Intr 


62829 


62647 


— 


23 




Intr 


63035 


62931 


— 


23 




Intr 


63206 


63192 


— 


23 




Intr 


63493 


63251 


— 


23 


15 


Intr 


64223 


63582 


— 


23 




Intr 


64869 


64746 


— 


23 




Intr 


65397 


65397 


— 


23 




Intr 


65485 


65455 




23 




Intr 


65692 


65641 




23 


20 


Intr 


65750 


65750 




23 




T n 1~ T 


65965 


65809 




23 




Intr 


6119Q 


67643 


— 


23 




I nt r 


67856 


67824 




23 




Intr 


68111 


67974 


— 


23 


25 


Intr 


68393 


68190 


— 


23 




Init 


68775 


68722 




23 




>6041783 


/114015 






30 


Term 


69666 


69650 


— 


24 




Intr 


70086 


70010 


— 


24 




X 1 1 ' L 


70374 


70295 




24 




Intr 


70829 


70662 


— 


24 




Intr 


71364 


71291 


— 


24 


35 


Intr 


71436 


71394 


— 


24 




Intr 


71670 


71633 


— 


24 




Init 


72191 


71987 


— 


24 




>6041783 


/37037 




















Term 


76407 


76299 


— 


26 






76936 


76672 




26 




Intr 


77117 


77038 


— 


26 




Intr 


77547 


77405 


— 


26 


45 


Init 


78190 


78116 


— 


26 




>6041783 


/40253 








Init 


79129 


79352 


+ 


29 


50 


Intr 


79667 


79943 


+ 


29 




~r "H V" 
J. 1 1 L -L 


80037 


80126 


L 


2 9 




Intr 


80348 


80419 


+ 


29 




X i i 1 L 


80499 


80576 


+ 


29 




Intr 


80671 


80736 


+ 


29 


55 


Intr 


80819 


80900 


+ 


29 




Term 


80989 


81278 


+ 


29 




>6041783 




/149026 






60 


Init 


85797 


85832 


+ 


31 




Intr 


86388 


86643 


+ 


31 



Attorney No. 2750-1300P 



97 





Intr 


86875 


87020 




31 




Term 


87074 


87247 




31 


r" 

5 


>6087918 


/ 










len = 


108661 nex 




172 




>6087918 


727793 






10 














Term 


20251 


20165 




7 




T "h 


20464 


20335 




7 




Intr 


20953 


20592 


— 


7 




Intr 


21185 


21090 


— 


7 


15 


Intr 


21338 


21269 


— 


7 




Intr 


21685 


21600 


— 


7 




Init 


21960 


21856 


— 


7 




>6087918 


/11776 






20 














Term 


71070 


70937 


— 


22 




T n 1" 


71684 


71201 




22 




Intr 


74464 


74317 


— 


22 




Intr 


74885 


74524 


— 


22 


il 2 5 


Intr 


75312 


75213 


— 


22 




Intr 


75378 


75344 


— 


22 




Init 


75977 


75552 


— 


22 




>6087918 


/19561 




















1 erm 


82305 


81709 




iL. '-X 




Intr 


82711 


82406 


— 


24 




xni. r 


82806 


82747 




7 4 




Intr 


83037 


82936 


— 


24 




Intr 


83512 


83449 


— 


24 




Init 


84652 


83598 


— 


24 


;.r_ 
^ :!r 


>6087918 


/34371 






40 


Init 


89621 


90013 




27 




Intr 


90106 


90192 


+ 


27 




Intr 


90273 


90392 




27 




Intr 


90473 


90571 




27 




Intr 


90649 


90817 


+ 


27 


45 


Intr 


90909 


91002 


+ 


27 




Intr 


91126 


91219 




27 




Intr 


91325 


91381 


+ 


27 




Intr 


91780 


92193 


+ 


27 




Term 


92279 


92440 


1 

-r 


z / 


50 














>6087919 


/ 










len = 


102300 


nex ~ 


148 




55 














>6087919 


/31888 








Single 


19312 


19668 




9 


60 


>6087919 


76275 







Attorney No. 2750-1300P 



98 





Init 


26022 


26057 




12 




Intr 


27400 


27633 


+ 


12 




Term 


27790 


28866 


+ 


12 


5 














>6087919 


/11776 








Term 


61829 


61696 


— 


23 




Int r 


62443 


61960 




23 


10 


Intr 


64075 


63950 


— 


23 




Intr 


64735 


64636 


— 


23 




Intr 


64801 


64767 




23 




Init 


65400 


64975 


— 


23 


15 


>6087919 


/19561 








Term 


71728 


71132 


— 


25 




Intr 


72134 


71829 


— 


25 




Intr 


72229 


72170 


— 


25 


20 


Intr 


72460 


72359 


— 


25 




Intr 


72935 


72872 


— 


25 




Init 


74075 


73021 


— 


25 




>60879I9 


/34371 






25 














Init 


79043 


79435 


+ 


28 




Intr 


79528 


79614 


+ 


28 




Intr 


79695 


79814 


+ 


28 




Intr 


79895 


79993 




28 


30 


Intr 


80071 


80239 


+ 


28 




Intr 


80331 


80424 


+ 


28 




Intr 


80548 


80641 


+ 


28 




Intr 


80747 


80803 


+ 


28 




Intr 


81202 


81615 


+ 


28 


35 


Term 


81701 


81862 




28 



i 



Attorney No. 2750-1300P 



99 

CLAIMS 

What is claimed is: 



1. An isolated nucleic acid molecule comprising: 

a) a nucleic acid having a nucleotide sequence which encodes an amino acid sequence 
exhibiting at least 40% sequence identity to an amino acid sequence encoded by 

(1) a nucleotide sequence described in Tables 1 and/or 2 or a fragment thereof; or 

(2) a complement of a nucleotide sequence shown in Tables 1 and/or 2 or a fragment 
thereof; 

b) a nucleic acid which is the reverse of the nucleotide sequence according to 
subparagraph (a), such that the reverse nucleotide sequence has a sequence order 
which is the reverse of the sequence order of the nucleotide sequence according to 
subparagraph (a); 

c) a nucleic acid capable of hybridizing to a nucleic acid having a sequence selected 
from the group consisting of: 

(1) a nucleotide sequence which is shown in Tables 1 and/or 2; and 

(2) a nucleotide sequence which is complementary to a nucleotide sequence shown in 
Tables 1 and/or 2, under conditions that permit formation of a nucleic acid duplex 
at a temperature from about 40*^C and 48°C below the melting temperature of the 
nucleic acid duplex. 

2. An isolated nucleic acid molecule comprising a nucleic acid having a nucleotide sequence 
which exhibits at least 65% sequence identity to 

a) a nucleotide sequence shown in Tables 1 and/or 2 or a fragment thereof; or 

b) a complement of a nucleotide sequence described in Tables 1 and/or 2 or a fragment 
thereof. 

3. The nucleic acid molecule according to claim 1, wherein said nucleic acid comprises an 
open reading frame. 

4. The isolated nucleic acid molecule of claim 1, wherein said nucleic acid is capable of 
functioning as a promoter, a 3* end termination sequence, an untranslated region (UTR), 
or as a regulatory sequence. 

5. The isolated nucleic acid molecule of claim 4, wherein (a) when said nucleic acid is a 
promoter it comprises a sequence selected from the group consisting of a TATA box 
sequence, a CAAT box sequence, a motif of GCAATCG or any transcription-factor 
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binding sequence, and any combination thereof; and (b) when said nucleic acid sequence 
is a regulatory sequence it is capable of promoting seed-specific expression, embryo- 
specific expression, ovule-specific expression, tapetum-specific expression or root- 
specific expression of a sequence or any combination thereof. 

6. A vector construct comprising: 

a) a first nucleic acid having a regulatory sequence capable of causing transcription 
and/or translation; and 

b) a second nucleic acid having the sequence of the isolated nucleic acid molecule 
according to claim 1; 

wherein said first and second nucleic acids are operably linked and 

wherein said second nucleic acid is heterologous to any element in said vector construct. 

7. The vector construct according to claim 6, wherein said first nucleic acid is native to said 
second nucleic acid. 

8. The vector construct according to claim 6, wherein said first nucleic acid is heterologous 
to said second nucleic acid. 

9. A host cell comprising an isolated nucleic acid molecule according to claim 1, wherein 
said nucleic acid molecule is flanked by exogenous sequence. 

10. A host cell comprising a vector construct of claim 6. 

11. An isolated polypeptide comprising an amino acid sequence 

a) exhibiting at least 40%, or 75%, or 85%, or 90% sequence identity of an amino acid 
sequence encoded by a sequence shown in Tables 1 and/or 2 or a fragment thereof; 
and 

b) capable of exhibiting at least one of the biological activities of the polypeptide 
encoded by said nucleotide sequence shown in Tables 1 and/or 2 or a fragment 
thereof. 

12. An antibody capable of binding the isolated polypeptide of claim 11. 

13. A method of introducing an isolated nucleic acid into a host cell comprising: 

a) providing an isolated nucleic acid molecule according to claim 1; and 

b) contacting said isolated nucleic with said host cell under conditions that permit 
insertion of said nucleic acid into said host cell, 

14. A method of transforming a host cell which comprises contacting a host cell with a vector 
construct according to claim 6. 

15. A method of modulating transcription and/or translation of a nucleic acid in a host cell 
comprising: 
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a) providing the host cell of claim 9; and 

b) culturing said host cell under conditions that permit transcription or translation. 

16. A method for detecting a nucleic acid in a sample which comprises: 

a) providing an isolated nucleic acid molecule according to claim 1; 

b) contacting said isolated nucleic acid molecule with a sample under conditions which 
permit a comparison of the sequence of said isolated nucleic acid molecule with the 
sequence of DNA in said sample; and 

c) analyzing the result of said comparison. 

17. A plant or cell of a plant which comprises a nucleic acid molecule according to claim 1 
which is exogenous or heterologous to said plant or plant cell. 

18. A plant or cell of a plant which comprises a vector construct according to claim 6. 

19. A plant which has been regenerated from a plant cell according to claim 17. 

20. A plant which has been regenerated from a plant cell according to claim 18. 
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ABSTRACT OF THE DISCLOSURE 

The present invention provides DNA molecules that constitute fragments of the 
genome of a plant, and polypeptides encoded thereby. The DNA molecules are useful for 
specifying a gene product in cells, either as a promoter or as a protein coding sequence or as 
5 an UTR or as a 3' termination sequence, and are also useful in controlling the behavior of a 
gene in the chromosome, in controlling the expression of a gene or as tools for genetic 
mapping, recognizing or isolating identical or related DNA fragments, or identification of a 
particular individual organism, or for clustering of a group of organisms with a common trait. 
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